(PDF) Cancer Cell Detection and Classification from Digital Whole Slide Image

Lecture Notes in Networks and Systems 558

Kingsley A. Ogudo

Sanjoy Kumar Saha

Debnath Bhattacharyya Editors

Smart

Technologies

in Data

Science and

Communication

Proceedings of SMART-DSC 2022

Lecture Notes in Networks and Systems

Volume 558

Series Editor

Janusz Kacprzyk, Systems Research Institute, Polish Academy of Sciences,

Warsaw, Poland

Advisory Editors

Fernando Gomide, Department of Computer Engineering and Automation—DCA,

School of Electrical and Computer Engineering—FEEC, University of

Campinas—UNICAMP, São Paulo, Brazil

Okyay Kaynak, Department of Electrical and Electronic Engineering,

Bogazici University, Istanbul, Turkey

Derong Liu, Department of Electrical and Computer Engineering, University of

Illinois at Chicago, Chicago, USA

Institute of Automation, Chinese Academy of Sciences, Beijing, China

Witold Pedrycz, Department of Electrical and Computer Engineering, University of

Alberta, Alberta, Canada

Systems Research Institute, Polish Academy of Sciences, Warsaw, Poland

Marios M. Polycarpou, Department of Electrical and Computer Engineering,

KIOS Research Center for Intelligent Systems and Networks, University of Cyprus,

Nicosia, Cyprus

Imre J. Rudas, Óbuda University, Budapest, Hungary

Jun Wang, Department of Computer Science, City University of Hong Kong,

Kowloon, Hong Kong

The series “Lecture Notes in Networks and Systems” publishes the latest

developments in Networks and Systems—quickly, informally and with high quality.

Original research reported in proceedings and post-proceedings represents the core

of LNNS.

Volumes published in LNNS embrace all aspects and subﬁelds of, as well as new

challenges in, Networks and Systems.

The series contains proceedings and edited volumes in systems and networks,

spanning the areas of Cyber-Physical Systems, Autonomous Systems, Sensor

Networks, Control Systems, Energy Systems, Automotive Systems, Biological

Systems, Vehicular Networking and Connected Vehicles, Aerospace Systems,

Automation, Manufacturing, Smart Grids, Nonlinear Systems, Power Systems,

Robotics, Social Systems, Economic Systems and other. Of particular value to

both the contributors and the readership are the short publication timeframe and

the world-wide distribution and exposure which enable both a wide and rapid

dissemination of research output.

The series covers the theory, applications, and perspectives on the state of the art

and future developments relevant to systems and networks, decision making, control,

complex processes and related areas, as embedded in the ﬁelds of interdisciplinary

and applied sciences, engineering, computer science, physics, economics, social, and

life sciences, as well as the paradigms and methodologies behind them.

Indexed by SCOPUS, INSPEC, WTI Frankfurt eG, zbMATH, SCImago.

All books published in the series are submitted for consideration in Web of Science.

For proposals from Asia please contact Aninda Bose (aninda.bose@springer.com).

Kingsley A. Ogudo · Sanjoy Kumar Saha ·

Debnath Bhattacharyya

Editors

Smart Technologies in Data

Science and Communication

Proceedings of SMART-DSC 2022

Editors

Kingsley A. Ogudo

University of Johannesburg

Johannesburg, South Africa

Debnath Bhattacharyya

Department of Computer Science

and Engineering

Koneru Lakshmaiah Education Foundation

Guntur, India

Sanjoy Kumar Saha

Department of Computer Science

and Engineering

Jadavpur University

Kolkata, West Bengal, India

ISSN 2367-3370 ISSN 2367-3389 (electronic)

Lecture Notes in Networks and Systems

ISBN 978-981-19-6879-2 ISBN 978-981-19-6880-8 (eBook)

https://doi.org/10.1007/978-981-19-6880-8

Singapore Pte Ltd. 2023

This work is subject to copyright. All rights are solely and exclusively licensed by the Publisher, whether

the whole or part of the material is concerned, speciﬁcally the rights of translation, reprinting, reuse

of illustrations, recitation, broadcasting, reproduction on microﬁlms or in any other physical way, and

transmission or information storage and retrieval, electronic adaptation, computer software, or by similar

or dissimilar methodology now known or hereafter developed.

The use of general descriptive names, registered names, trademarks, service marks, etc. in this publication

does not imply, even in the absence of a speciﬁc statement, that such names are exempt from the relevant

protective laws and regulations and therefore free for general use.

The publisher, the authors, and the editors are safe to assume that the advice and information in this book

are believed to be true and accurate at the date of publication. Neither the publisher nor the authors or

the editors give a warranty, expressed or implied, with respect to the material contained herein or for any

errors or omissions that may have been made. The publisher remains neutral with regard to jurisdictional

claims in published maps and institutional afﬁliations.

This Springer imprint is published by the registered company Springer Nature Singapore Pte Ltd.

The registered company address is: 152 Beach Road, #21-01/04 Gateway East, Singapore 189721,

Singapore

Conference Committee Members

Organizing Committee

General Chairs

Philippe Fournier-Viger, Shenzhen University, Guangdong, China

T. Pavan Kumar, Koneru Lakshmaiah Education Foundation, Vaddeswaram, Guntur,

Andhra Pradesh, India

Advisory Board

Yu-Chen Hu, Providence University, Taichung City, Taiwan

Zhihan Lv, Uppsala University, Sweden

Osvaldo Gervasi, University of Perugia, Italy

Paul S. Pang, Unitec Institute of Technology, New Zealand

Andrzej Goscinski, Deakin University, Australia

Jason Levy, University of Hawaii, Hawaii, USA

Tai-hoon Kim, Konkuk University, South Korea

Sabah Mohammed, Lakehead University, Ontario, Canada

Jinan Fiaidhi, Lakehead University, Ontario, Canada

Y. Byun, Jeju National University, South Korea

Amiya Bhaumick, LUC KL, Malaysia

Dipti Prasad Mukherjee, ISI Kolkata, India

Sanjoy Kumar Saha, Jadavpur University, Kolkata, India

Sekhar Verma, IIIT Allahabad, India

Pabitra Mitra, IIT Kharagpur, India

Joydeep Chandra, IIT Patna, India

Yu-Chen Hu, Providence University, Taiwan

v

vi Conference Committee Members

G. Yvette, Iloilo Science and Technology University, Philippines

Rosslin Robles, University of San Agustin, Philippines

Hamed Daie Kasmaie, Islamic Azad University, Tehran, Iran

Alexey Setekein, Immanuel Kant Baltic Federal University, Russia

Paul S. Pang, Unitec Institute of Technology, New Zealand

Prabhat Mahanti, University of New Brunswick, Canada

Richard G. Bush, Baker College, Michigan, USA

Sayan K. Ray, Manukau Institute of Technology, Manukau, New Zealand

Soocheol Kim, Daegu Catholic University, South Korea

N. Thirupathi Rao, Vignan’s Institute of Information Technology, India

Mohammed Usman, King Khalid University, Abha, Saudi Arabia

Oscar Cordon, Digital University, University of Granada, Spain

Tarek Sobh, University of Bridgeport, Connecticut, USA

Xiao-Zhi Gao, University of Eastern Finland, Finland

Tseren-Onolt Ishdorj, Mongolian University of Science and Technology, Mongolia

Khuder Altangerel, Mongolian University of Science and Technology, Mongolia

Randy Santocildes Tolentino, Hanseo University, South Korea

Diego Galar, Lulea University of Technology, Sweden

Divya Midhunchakkaravarthy, Lincoln University College, Malaysia

C. V. Jawhar, IIIT, Hyderabad, India

Alexander Gelbukh, National Polytechnic Institute, Mexico

Saptarshi Das, Pennsylvania State University, USA

Sourav Sen Gupta, NTU, Singapore

Roumen Koumtchev, Technical University of Soﬁa, Bulgaria

Christopher Lazarus, Tunku Abdul Rahman University College, Malaysia

Fatiha Merazka, University of Science and Technology Houari Boumediene, Algeria

JinAn Xu, Beijing Jiaotong University, China

Sebastian Ventura Soto, Universidad de Cordoba, Spain

Koneru Satyanarayana, KLEF, Guntur, India

K. Siva Kanchana Latha, KLEF, Guntur, India

Koneru Lakshman Havish, KLEF, Guntur, India

Koneru Raja Hareen, KLEF, Guntur, India

K. S. Jagannatha Rao, KLEF, Guntur, India

G. Pardha Saradhi Varma, KLEF, Guntur, India

N. Venkatram, KLEF, Guntur, India

Megha Bhushan, DIT University, Dehradun, India

Editorial Board

Debnath Bhattacharyya, KL University, Guntur, India

Kingsley A. Ogudo, University of Johannesburg, South Africa

Sanjoy Kumar Saha, Jadavpur University, Kolkata

Conference Committee Members vii

Programme Chairs

Pelin Angin, Middle East Technical University, Turkey

S. Sagar Imambi, K. L. Deemed to be University, Guntur, India

Management Co-chairs

Sonia Djebali, ESILV—Ecole Supérieure d’Ingénieurs Léonard de Vinci, Paris,

France

V. Srikanth, KL University, Guntur, India

Publicity Committee

K. Ravindranath, KL University, Guntur, India

P. Vidya Sagar, KL University, Guntur, India

M. Nageswara Rao, KL University, Guntur, India

Venkata Naresh Mandhala, KL University, Guntur, India

Finance Committee

Venkata Naresh Mandhala, KL University, Guntur, India

Local Arrangements Committee

P. S. V. S. Sridhar, KL University, Guntur, India

K. V. Raju, KL University, Guntur, India

K. Swarna, KL University, Guntur, India

Technical Programme Committee

Sanjoy Kumar Saha, Professor, Jadavpur University, Kolkata

Hans Werner, Associate Professor, University of Munich, Munich, German

Goutam Saha, Scientist, CDAC, Kolkata, India

Samir Kumar Bandyopadhyay, Professor, University of Calcutta, India

viii Conference Committee Members

Ronnie D. Caytiles, Associate Professor, Hannam University, Republic of Korea

Y. Byun, Professor, Jeju National University, Jeju Island, Republic of Korea

Alhad Kuwadekar, Professor, University of South Wales, UK

Debasri Chakraborty, Asst. Professor, BIET, Suri, West Bengal, India

Poulami Das, Assistant Professor, Heritage Institute of Technology, Kolkata, India

Indra Kanta Maitra, Associate Professor, St. Xavier’s University, Kolkata, India

Divya Midhun Chakravarty, Professor, LUC, KL, Malaysia

F. C. Morabito, Professor, Mediterranea University of Reggio Calabria, Reggio

Calabria RC, Italy

Hamed Kasmaei, Islamic Azad University, Tehran, Iran

Nancy A. Bonner, University of Mary Hardin-Baylor, Belton, TX 76513, USA

Alfonsas Misevicius, Professor, Kaunas University of Technology, Lithuania

Ratul Bhattacharjee, AxiomSL, Singapore

Lunjin Lu, Professor, Computer Science and Engineering, Oakland University,

Rochester, MI 48309-4401, USA

Ajay Deshpande, CTO, Rakya Technologies, Pune, India

Debapriya Hazra, Jeju National University, Jeju Island, South Korea

Alexandra Branzan Albu, University of Victoria, Victoria, Canada

G. Yvette, Iloilo Science and Technology University, Philippines

M. H. M. Krishna Prasad, Professor, UCEK, JNTUK, Kakinada, India

N. Thirupathi Rao, Associate Professor, Vignan’s Institute of Information Tech-

nology, Visakhapatnam-530049, India

P. Kishore Kumar, Associate Professor, Vignan’s Institute of Information Tech-

nology, Visakhapatnam-530049, India

Joydeep Chandra, Assistant Professor, IIT Patna, Patna, India

A. Maiti, Assistant Professor, IIT Patna, Patna, India

Jianbang Du, Texas Southern University, USA

Richard G., Bush, Baker College, Michigan, USA

Sourav Sen Gupta, NTU, Singapore

Rosslin J. Robles, Associate Professor, University of San Agustin, Philippines

Jason Levy, University of Hawaii, USA

Daniel Ruiz Fernandez, University of Alicante, Spain

Christo El Morr, York University, Canada

Sayan K. Ray, Manukau Institute of Technology, Manukau, New Zealand

Sanjoy Kumar Saha, Jadavpur University, Kolkata, India

Rinkle Rani, Thapar University, India

G. Neelima, Associate Professor, VIIT, Visakhapatnam, India

Kalpdrum Passi, Laurentian University, Canada

Wafa Shafqat, Jeju National University, South Korea

Alexey Seteikin, Immanuel Kant Baltic Federal University, Russia

Zhang Yu, Harbin University of Commerce, China

Arindam Biswas, Kazi Nazrul University, West Bengal, India

Preface

Knowledge in engineering sciences is about sharing our ideas of research to others. In

engineering, it has many ways to exhibit. In that conference is the best way to propose

your idea of research and its future scope and to add energy to build a strong and

innovative future. So, here we are to give a small support from our side to confer your

ideas by an “International Conference on Smart Technologies in Data Science and

Communication (SMART-DSC 2021)” related to electrical, electronics, information

technology and computer science. It is not conﬁned to a speciﬁc topic or region,

and you can exhibit your ideas in similar or mixed or related technologies bloomed

from anywhere around the world because “An idea can change the future and its

implementation can build it”. KLEF Deemed to be University is a great platform to

make your idea(s) penetrated into world. We give as best as we can in every aspect

related. Our environment leads you to a path on your idea, our people will lead your

conﬁdence, and ﬁnally, we give our best to make yours. Our intention is to make

intelligence in engineering to ﬂy higher and higher. That is why we are dropping our

completeness into event. You can trust us on your conﬁdentiality. Our review process

is double blinded through Easy Chair.

At last, we pay the highest regard to the Koneru Lakshmaiah Education Foun-

dation, K. L. Deemed to be University from Guntur and Hyderabad, for extending

support for ﬁnancial management of 5th SMART-DSC 2022.

Guntur, India

Johannesburg, South Africa

Kolkata, India

Debnath Bhattacharyya

Kingsley A. Ogudo

Sanjoy Kumar Saha

ix

Acknowledgements

The editors wish to extend heartfelt acknowledgement to all contributing authors,

esteemed reviewers for their timely response, members of the various organizing

committee and production staff whose diligent work puts a shape to the 5th SMART-

DSC 2022 proceedings. We especially thank our dedicated reviewers for their volun-

teering efforts to check the manuscript thoroughly to maintain the technical quality

and for useful suggestions.

We thank all the following invited speakers who extended their support by sharing

knowledge in their area of expertise.

Prof. Philippe Fournier-Viger, Harbin Institute of Technology (Shenzhen), Shen-

zhen, Guangdong, China.

Prof. Jose L. Seño, Chair, Computer Science Department, College of Information

and Computing Sciences, University of Santo Tomas, Philippines.

Dr. Khuder Altangerel, Head of Computer Science Department, School of

Information, Communication Technology, Mongolian University of Science and

Technology, Ulaanbaatar, Mongolia 14191.

Dr. Shumaila Javaid, Shanghai Research Institute for Intelligent Autonomous

Systems, Tongji University, Shanghai, China.

Dr. Djebali Sonia, ESILV—Ecole Supérieure d’Ingénieurs Léonard de Vinci,

Paris, France.

Dr. Pelin Angin, Middle East Technical University, Ankara, Turkey.

Divya Midhunchakkaravarthy, Lincoln University College, Faculty of Computer

Science and Multimedia, Selangor, Malaysia.

Dr. Snehanshu Pal, National Institute of Technology Rourkela, Rourkela, Odisha,

India.

Debnath Bhattacharyya

Kingsley A. Ogudo

Sanjoy Kumar Saha

xi

Contents

A Graph-Based Model for Discovering Host-Based Hook Attacks ...... 1

P. Pandiaraja, K. Muthumanickam, and R. Palani Kumar

E-Health Care Patient Information Retrieval and Monitoring

System Using SVM ................................................ 15

K. Sumathi and P. Pandiaraja

Number Plate Recognition Using Optical Character Recognition

(OCA) and Connected Component Analysis (CCA) ................... 29

Puppala Ramya, Tummala Haswanth Chowdary,

Pisupati Krishna Teja, and Tadepally Hrushikesh

Cartoonify an Image with OpenCV Using Python .................... 41

Puppala Ramya, Penki Ganesh, Kopanathi Mouli,

and Vutla Naga Sai Akhil

Web Design as an Important Factor in the Success of a Website ........ 51

Puppala Ramya, K. Jai Sai Chaitanya, S. K. Fardeen, and G. Prabhakar

Earlier Selection of Routes for Data Transfer In Both Wired

and Wireless Networks ............................................. 61

S. NagaMallik Raj, S. Neeraja, N. Thirupathi Rao,

and Debnath Bhattacharyya

Identifying River Drainage Characteristics by Deep Neural

Network .......................................................... 71

Vithya Ganesan, Tejaswi Talluru, Manoj Challapalli,

and Chandana Seelam

A Review on Optimal Deep Learning Based Prediction Model

for Multi Disease Prediction ........................................ 81

Aneel Kumar Minda and Vithya Ganesan

xiii

xiv Contents

A Hybrid Multi-user Based Data Replication and Access Control

Mechanism for Cloud Data Security ................................. 91

V. Devi Satya Sri and Srikanth Vemuru

Leveraging the Goldﬁnger Attack in Blockchain Based

on the Topological Properties ....................................... 101

Arcel Kalenga Muteba and Kingsley A. Ogudo

Bitcoin Transaction Computational Efﬁciency and Network Node

Power Consumption Prediction Using an Artiﬁcial Neural Network .... 109

Arcel Kalenga Muteba, Kingsley A. Ogudo, and Espoir M. M. Bondo

Remote Breast Cancer Patient Monitoring System: An Extensive

Review ........................................................... 117

Sangeeta Parshionikar and Debnath Bhattacharyya

Simplifying the Code Editor Using MEAN Stack Technologies ......... 129

S. NagaMallik Raj, M. Jyothsna, P. Srinu, S. Karthik,

K. Gnana Jeevana, N. Thirupathi Rao, and Debnath Bhattacharyya

Prediction and Identiﬁcation of Diseases to the Crops Using

Machine Learning ................................................. 139

S. NagaMallik Raj, Pyla Lohit, Doddala Jyo-theendra,

Kannuru Chandana, P. Nikhil, N. Thirupathi Rao,

and Debnath Bhattacharyya

Pulse-Based Smart Electricity Meter Using Raspberry Pi

and MEFN ........................................................ 147

Eswar Abisheak Tadiparthi, Majji Prasanna Kumari,

Basanaboyana Vamsi Sai, Kollana Bharat Kalyan, B. Dinesh Reddy,

N. Thirupathi Rao, and Debnath Bhattacharyya

Brain Tumor Segmentation Using U-Net ............................. 153

Paturi Jyothsna, Mamidi Sai Sri Venkata Spandhana, Rayi Jayasri,

Nirujogi Venkata Sai Sandeep, K. Swathi, N. Marline Joys Kumari,

N. Thirupathi Rao, and Debnath Bhattacharyya

An Empirical Study of CNN-Deep Learning Models for Detection

of Covid-19 Using Chest X-Ray Images .............................. 161

Mohd. Abdul Muqeet, Quazi Mateenuddin Hameeduddin,

B. Mohammed Ismail, Ali Baig Mohammad, Shaik Qadeer,

and M. Muzammil Parvez

Detection of Eye Blink Using SVM Classiﬁer ......................... 171

Varaha Sai Adireddi, Charan Naga Santhu Jagadeesh Boddeda,

Devi Shanthisree Kumpatla, Chris Daniel Mantri, B. Dinesh Reddy,

G. Geetha, N. Thirupathi Rao, and Debnath Bhattacharyya

Contents xv

A Novel Approach for Health Analysis Using Machine Learning

Approaches ....................................................... 179

Debdatta Bhattacharya, N. Thirupathi Rao, K. Asish Vardhan,

and Eali Stephen Neal Joshua

Classiﬁcation of Healthy and Diseased Lungs by Pneumonia Using

X-Rays and Gene Sequencing With Deep Learning Approaches ........ 189

Debdatta Bhattacharya, K. V. Satyanarayana, N. Thirupathi Rao,

and Eali Stephen Neal Joshua

Breast Cancer Classiﬁcation Using Improved Fuzzy C-Means

Algorithm ........................................................ 197

N. Thirupathi Rao, K. V. Satyanarayana, M. Satyanarayana,

Eali Stephen Neal Joshua, and Debnath Bhattacharyya

Repercussions of Incorporating Filters in CNN Model to Boost

the Diagnostic Ability of SARS-CoV-2 Virus Using Chest

Computed Tomography Scans ...................................... 205

Dhiren Dommeti, Siva Rama Krishna Nallapati, P. V. V. S. Srinivas,

and Venkata Naresh Mandhala

Software Development Estimation Cost Using ANN ................... 215

Puppala Ramya, M. Sai Mokshith, M. Abdul Rahman, and N. Nithin Sai

A Generic Flow of Cyber-Physical systems—A Comprehensive

Survey ............................................................ 223

Jampani Satish Babu, Gonuguntla Krishna Mohan, and N. Praveena

Mental Disorder Detection in Social Networks Using SVM

Classiﬁcation: An Improvised Approach ............................. 241

B. Dinesh Reddy, Eali Stephen Neal Joshua, N. Thirupathi Rao,

and Debnath Bhattacharyya

An Enhanced K-Means Clustering Algorithm to Improve

the Accuracy of Clustering Using Centroid Identiﬁcation Based

on Compactness Factor ............................................ 251

Eali Stephen Neal Joshua, K. Asish Vardhan, N. Thirupathi Rao,

and Debnath Bhattacharyya

Prediction of Chronic Kidney Disease with Various Machine

Learning Techniques: A Comparative Study ......................... 257

K. Swathi and G. Vamsi Krishna

Blockchain and Its Idiosyncratic Effects on Energy Consumption

and Conservation .................................................. 263

K. Mrudula Devi, D. Surya Sai, N. Thirupathi Rao, K. Swathi,

and Swathi Voddi

xvi Contents

Smart Hydroponics System for Soilless Farming Based on Internet

of Things ......................................................... 271

G. V. Danush Ranganath, R. Hari Sri Rameasvar, and A. Karthikeyan

Solution Approach for Detection of Stock Price Manipulation

by Market Operators .............................................. 281

Yogesh Kakde, Ganesh Chavan, Basant Sah, and Apoorva Sen

Cancer Cell Detection and Classiﬁcation from Digital Whole Slide

Image ............................................................ 289

Anil B. Gavade, Rajendra B. Nerli, Shridhar Ghagane,

Priyanka A. Gavade, and Venkata Siva Prasad Bhagavatula

Author Index ...................................................... 301

Editors and Contributors

About the Editors

Prof. (Dr.) Kingsley A. Ogudo, Ph.D. received the Master in Electrical Elec-

tronics/telecommunication engineering and Doctoral Degree in electrical and elec-

tronics engineering technology from the Tshwane University of Technology (TUT),

South Africa, in 2010 and 2016 respectively. He received his Ph.D. in Electronics

and Optoelectronics systems from the University of Paris Est, France in July year

2018. His research interest includes electronic, optoelectronic devices, Power elec-

tronics; System Integration of Devices based on Renewable Energy management

Sources, Telecommunication engineering high-frequency electronics, AI, IoT and

Data Analytics, physics and applied mathematics. He is a Professional Engineer

Technologist certiﬁed by ECSA and he is a member of the IEEE Society. He is a

Fellow at SAIEE, and Secretary General for SAIEE Entrepreneur and innovation

chapter. He has lectured in three different university (Tshwane University of Tech-

nology, UNISA and UJ) for the past 11 years, and has thought different electrical

and electronics engineering modules (Subjects) to both undergraduates and post-

graduates students. He has published over 65 international ISI Journal articles and

international conference papers. He is currently an Associate Professor/Researcher

at the Department of Electrical and Electronics Engineering Technology, University

of Johannesburg (UJ), South Africa.

Dr. Sanjoy Kumar Saha currently associated as Professor with the Department

of Computer Science and Engineering, Jadavpur University, Kolkata, India. He

did is BE and ME from Jadavpur University and completed his Ph.D. from IIEST

Shibpur, West Bengal, India. His Research interests include Image, Video and Audio

Data Processing, Physiological Sensor Signal Processing and Data Analytics. He

published more than hundred articles in various International Journals and Confer-

ences of repute. He has guided eleven Ph.D. Students. He holds four US patents. Dr.

Saha is a member of IEEE Computer Society, Indian Unit for Pattern Recognition

xvii

xviii Editors and Contributors

and Artiﬁcial Intelligence, ACM. He has served TCS innovation Lab, Kolkata, India

as advisor for the signal processing group.

Prof. (Dr.) Debnath Bhattacharyya is associated as a Professor with Computer

Science and Engineering Department, Koneru Lakshmaiah Education Foundation

(known as K. L. Deemed to be University), Guntur, Andhra Pradesh, India. Dr. Bhat-

tacharyya is presently an Invited/Visiting International Professor, Lincoln University

College, KL, Malaysia and University of Johannesburg, South Africa. Dr. Bhat-

tacharyya received his Ph.D. (Tech., Computer Science and Engineering) from the

University of Calcutta, Kolkata, India. Dr. Bhattacharyya is the Senior Member of

IEEE, Member of ACM, and Life Member of CSI, India. He is the Editor of Many

International Journals (indexed by Scopus, SCI, and Web of Science). He Published

234 Scopus Indexed Papers, and 145 Web of Science Papers. His Research inter-

ests include Security Engineering, Pattern Recognition, Biometric Authentication,

Multimodal Biometric Authentication, Data Mining and Image Processing.

Contributors

Abdul Rahman M. Department of Computer Science and Engineering, Koneru

Lakshmaiah Education Foundation, Vaddeswaram, India

Adireddi Varaha Sai Department of Computer Science and Engineering, Vignan’s

Institute of Information Technology (A), Visakhapatnam, Andhra Pradesh, India

Akhil Vutla Naga Sai Department of Computer Science and Engineering, Koneru

Lakshmaiah Education Foundation, Guntur, India

Asish Vardhan K. Department of Computer Science and Engineering, Bullayya

College of Engineering for Women, Visakhapatnam, Andhra Pradesh, India

Babu Jampani Satish Department of Computer Science and Engineering, Koneru

Lakshmaiah Education Foundation, Vaddeswaram, Guntur, A.P, India

Bhagavatula Venkata Siva Prasad Medtronic, Hyderabad, India

Bhattacharya Debdatta Department of Computer Science and Engineering,

Koneru Laksmaiah Education Foundation, Vaddeswaram, Guntur, Andhra Pradesh,

India

Bhattacharyya Debnath Department of Computer Science and Engineering,

Koneru Lakshmaiah Education Foundation, Vaddeswaram, Guntur, Andhra Pradesh,

India

Boddeda Charan Naga Santhu Jagadeesh Department of Computer Science and

Engineering, Vignan’s Institute of Information Technology (A), Visakhapatnam,

Andhra Pradesh, India

Editors and Contributors xix

Bondo Espoir M. M. Engineering Research and Development BOND’AF, Paris,

France

Challapalli Manoj CSE, Koneru Lakshmaiah Education Foundation, Guntur,

Andhra Pradesh, India

Chandana Kannuru Department of CSE, Vignan’s Institute of Information Tech-

nology (A), Duvvada, Visakhapatnam, India

Chavan Ganesh KL University, Guntur, AP, India

Chowdary Tummala Haswanth Department of Computer Science and Engi-

neering, Koneru Lakshmaiah Education Foundation, Guntur, India

Danush Ranganath G. V. School of Electrical Engineering, Vellore Institute of

Technology, Vellore, India

Devi Satya Sri V. Department of Computer Science and Engineering, Koneru

Lakshmaiah Education Foundation, Vaddeswaram, Guntur District, A.P, India

Devi K. Mrudula Department of Mathematics, Vignan’s Institute of Information

Technology (A), Visakhapatnam, AP, India

Dinesh Reddy B. Department of Computer Science and Engineering, Vignan’s

Institute of Information Technology, Visakhapatnam, Andhra-Pradesh, India

Dommeti Dhiren Department of Computer Science Engineering, Koneru Laksh-

maiah Education Foundation, Vaddeswaram, Andhra Pradesh, India

Fardeen S. K. Department of Computer Science and Engineering, Koneru Laksh-

maiah Education Foundation, Guntur, India

Ganesan Vithya CSE, Koneru Lakshmaiah Education Foundation, Guntur, Andhra

Pradesh, India

Ganesh Penki Department of Computer Science and Engineering, Koneru Laksh-

maiah Education Foundation, Guntur, India

Gavade Anil B. Department of E&C, KLS Gogte Institute of Technology, Belagavi,

Karnataka, India

Gavade Priyanka A. Department of Computer Science and Engineering, KLE

Tech University Dr. M. S. Sheshgiri College of Engineering and Technology,

Belagavi, Karnataka, India

Geetha G. Department of Information Technology, VR Siddhartha Engineering

College, Kanuru, Vijayawada, Andhra Pradesh, India

Ghagane Shridhar Department of Biotechnology, KAHER’s Dr. Prabhakar Kore

Basic Science Research Center, V. K. Institute of Dental Sciences Campus, Belagavi,

Karnataka, India

xx Editors and Contributors

Hameeduddin Quazi Mateenuddin Faculty of Electronics and Communication

Engineering, Indian Naval Academy, Ezhimala, Kerala, India

Hari Sri Rameasvar R. School of Electrical Engineering, Vellore Institute of

Technology, Vellore, India

Hrushikesh Tadepally Department of Computer Science and Engineering, Koneru

Lakshmaiah Education Foundation, Guntur, India

Jai Sai Chaitanya K. Department of Computer Science and Engineering, Koneru

Lakshmaiah Education Foundation, Guntur, India

Jayasri Rayi Department of Computer Science & Engineering, Vignan’s Institute

of Information Technology (A), Visakhapatnam, AP, India

Jeevana K. Gnana Department of CSE, Vignan’s Institute of Information Tech-

nology (A), Duvvada, Visakhapatnam, India

Joshua Eali Stephen Neal Department of Computer Science and Engineering,

Vignan’s Institute of Information Technology, Visakhapatnam, Andhra Pradesh,

India

Jyo-theendra Doddala Department of CSE, Vignan’s Institute of Information

Technology (A), Duvvada, Visakhapatnam, India

Jyothsna M. Department of CSE, Vignan’s Institute of Information Technology

(A), Duvvada, Visakhapatnam, India

Jyothsna Paturi Department of Computer Science & Engineering, Vignan’s Insti-

tute of Information Technology (A), Visakhapatnam, AP, India

Kakde Yogesh KL University, Guntur, AP, India

Kalyan Kollana Bharat Department of Computer Science and Engineering,

Vignan’s Institute of Information Technology, Visakhapatnam, Andhra-Pradesh,

India

Karthik S. Department of CSE, Vignan’s Institute of Information Technology (A),

Duvvada, Visakhapatnam, India

Karthikeyan A. School of Electrical Engineering, Vellore Institute of Technology,

Vellore, India

Kumari Majji Prasanna Department of Computer Science and Engineering,

Vignan’s Institute of Information Technology, Visakhapatnam, Andhra-Pradesh,

India

Kumpatla Devi Shanthisree Department of Computer Science and Engineering,

Vignan’s Institute of Information Technology (A), Visakhapatnam, Andhra Pradesh,

India

Lohit Pyla Department of CSE, Vignan’s Institute of Information Technology (A),

Duvvada, Visakhapatnam, India

Editors and Contributors xxi

Mandhala Venkata Naresh Department of Computer Science Engineering,

Koneru Lakshmaiah Education Foundation, Vaddeswaram, Andhra Pradesh, India

Mantri Chris Daniel Department of Computer Science and Engineering, Vignan’s

Institute of Information Technology (A), Visakhapatnam, Andhra Pradesh, India

Marline Joys Kumari N. Department of Computer Science & Engineering, Anil

Neerukonda Institute of Technology and Sciences, Visakhapatnam, AP, India

Minda Aneel Kumar International SOS, Dubai, United Arab Emirates

Mohammad Ali Baig School of Electronics and Communication Engineering,

REVA University, Bengaluru, India

Mohammed Ismail B. Department of Artiﬁcial Intelligence & Machine Learning,

P.A. College of Engineering, Mangalore, Karnataka, India

Mohan Gonuguntla Krishna Department of Computer Science and Engineering,

Koneru Lakshmaiah Education Foundation, Vaddeswaram, Guntur, A.P, India

Mouli Kopanathi Department of Computer Science and Engineering, Koneru

Lakshmaiah Education Foundation, Guntur, India

Muqeet Mohd. Abdul Electrical Engineering Department, Muffakham Jah

College of Engineering and Technology, Hyderabad, India

Muteba Arcel Kalenga Department of Electrical and Electronics Engineering

Technology, University of Johannesburg, Johannesburg, South Africa

Muthumanickam K. Department of Information Technology, Kongunadu College

of Engineering and Technology (Autonomous), Thottiyam, Tiruchirappalli, India

Muzammil Parvez M. Electronics and Communication Engineering Department,

KLEF, Deemed to Be University, Vaddeswaram, A.P, India

NagaMallik Raj S. Department of Computer Science & Engineering, Vignan’s

Institute of Information Technology (A), Visakhapatnam, Andhra Pradesh, India

Nallapati Siva Rama Krishna Department of Computer Science Engineering,

Koneru Lakshmaiah Education Foundation, Vaddeswaram, Andhra Pradesh, India

Neal Joshua Eali Stephen Department of Computer Science and Engineering,

Vignan’s Institute of Information Technology, Visakhapatnam, Andhra Pradesh,

India

Neeraja S. Department of Computer Science &, Software Engineering Lendi

Institute of Engineering and Technology, Vizianagaram, Andhra Pradesh, India

Nerli Rajendra B. Department of Urology, JN Medical College, KLE Academy of

Higher Education and Research (Deemed-to-Be-University), Belagavi, Karnataka,

India

xxii Editors and Contributors

Nikhil P. Department of CSE, Vignan’s Institute of Information Technology (A),

Duvvada, Visakhapatnam, India

Nithin Sai N. Department of Computer Science and Engineering, Koneru Laksh-

maiah Education Foundation, Vaddeswaram, India

Ogudo Kingsley A. Department of Electrical and Electronics Engineering Tech-

nology, University of Johannesburg, Johannesburg, South Africa

Palani Kumar R. Department of Information Technology, Kongunadu College of

Engineering and Technology (Autonomous), Thottiyam, Tiruchirappalli, India

Pandiaraja P. Department of Computer Science and Engineering, M.Kumarasamy

College of Engineering, Thalavapalayam, Karur, TamilNadu, India

Parshionikar Sangeeta Department of Computer Science and Engineering,

Koneru Lakshmaiah Education Foundation, KLEF, Guntur, Andhra Pradesh, India

Prabhakar G. Department of Computer Science and Engineering, Koneru Laksh-

maiah Education Foundation, Guntur, India

Praveena N. VR Siddharth Engineering College, Kanuru, Vijayawada, AP, India

Qadeer Shaik Electrical Engineering Department, Muffakham Jah College of

Engineering and Technology, Hyderabad, India

Ramya Puppala Department of Computer Science and Engineering, Koneru

Lakshmaiah Education Foundation, Vaddeswaram, Guntur, India

Rao N. Thirupathi Department of Computer Science and Engineering, Vignan’s

Institute of Information Technology (A), Visakhapatnam, AP, India

Reddy B. Dinesh Department of Computer Science and Engineering, Vignan’s

Institute of Information Technology (A), Visakhapatnam, Andhra Pradesh, India

Sah Basant KL University, Guntur, AP, India

Sai Mokshith M. Department of Computer Science and Engineering, Koneru

Lakshmaiah Education Foundation, Vaddeswaram, India

Sai Basanaboyana Vamsi Department of Computer Science and Engineering,

Vignan’s Institute of Information Technology, Visakhapatnam, Andhra-Pradesh,

India

Sai D. Surya Department of Computer Science and Engineering, Vignan’s Institute

of Information Technology (A), Visakhapatnam, AP, India

Sandeep Nirujogi Venkata Sai Department of Computer Science & Engineering,

Vignan’s Institute of Information Technology (A), Visakhapatnam, AP, India

Satyanarayana K. V. Department of Computer Science and Engineering, Raghu

Engineering College, Visakhapatnam, Andhra Pradesh, India

Editors and Contributors xxiii

Satyanarayana M. Department of Computer Science and Engineering, Raghu

Engineering College, Visakhapatnam, Andhra Pradesh, India

Seelam Chandana CSE, Koneru Lakshmaiah Education Foundation, Guntur,

Andhra Pradesh, India

Sen Apoorva Medi-Caps University, Indore, MP, India

Spandhana Mamidi Sai Sri Venkata Department of Computer Science & Engi-

neering, Vignan’s Institute of Information Technology (A), Visakhapatnam, AP,

India

SrinivasP.V.V.S. Department of Computer Science Engineering, Koneru Laksh-

maiah Education Foundation, Vaddeswaram, Andhra Pradesh, India

Srinu P. Department of CSE, Vignan’s Institute of Information Technology (A),

Duvvada, Visakhapatnam, India

Sumathi K. Department of Computer Science and Engineering, KSR Institute for

Engineering and Technology, Thiruchencode, TamilNadu, India

Swathi K. Department of Computer Science and Engineering, Vignan’s Institute of

Information Technology (A), Visakhapatnam, AP, India

Tadiparthi Eswar Abisheak Department of Computer Science and Engineering,

Vignan’s Institute of Information Technology, Visakhapatnam, Andhra-Pradesh,

India

Tall u r u Te j a s wi CSE, Koneru Lakshmaiah Education Foundation, Guntur, Andhra

Pradesh, India

Teja Pisupati Krishna Department of Computer Science and Engineering, Koneru

Lakshmaiah Education Foundation, Guntur, India

Vamsi Krishna G. Department of Computer Science and Engineering, Dr. Lanka-

palli Bullayya College of Engineering, Visakhapatnam, AP, India

Vardhan K. Asish Department of Computer Science and Engineering, Bullayya

College of Engineering for Women, Visakhapatnam, AP, India

Vemuru Srikanth Department of Computer Science and Engineering, Koneru

Lakshmaiah Education Foundation, Vaddeswaram, Guntur District, A.P, India

Voddi Swathi Department of Computer Science and Engineering, Prasad V. Potluri

Siddhartha Institute of Technology, Vijayawada, Andhra Pradesh, India

A Graph-Based Model for Discovering

Host-Based Hook Attacks

P. Pandiaraja, K. Muthumanickam, and R. Palani Kumar

Abstract Though computer malicious software can be referred with different

names such as virus, worm, Trojan, spam, and botnet, their ultimate goal is to

causing damage to the end-computer or end-user. The progression in computer tech-

nology allows a malware writer to integrate obfuscation technique to evade detec-

tion speciﬁcally API hooking in Windows. Unfortunately, signature-based detection

approach such as anti-virus software at the end-computer is not effective against

system call reordering. To overcome this shortcoming, many different behavior-based

approaches have been offered. However, these approaches bear limitations such as

false positive, detecting zero-day attacks, and improving detection accuracy rate from

past experience. In this article, an application programming interface (API)-based

call graph model is put forward which captures API system call during malicious

rootkit execution in Windows platform. As graph model can be effectively applied to

replica complicated relation between entities, we opt it to visualize malicious rootkit

behavior activities by monitoring system API calls. This will help the defender to

optimally ﬁnd malicious system calls from benign calls. Our simulated experiment

analysis proves that our method achieves higher detection rate and accuracy with

less false positive compared to existing techniques.

Keywords API hook ·Graph ·Malware attack ·Rootkit

P. Pandiaraja (B)

Department of Computer Science and Engineering, M.Kumarasamy College of Engineering,

Karur, Tamil Nadu 639113, India

e-mail: sppandiaraja@gmail.com

K. Muthumanickam · R. Palani Kumar

Department of Information Technology, Kongunadu College of Engineering and Technology

(Autonomous), Thottiyam, Tiruchirappalli 621215, India

e-mail: muthumanickam@kongunadu.ac.in

R. Palani Kumar

e-mail: palanikumar@kongunadu.ac.in

K. A. Ogudo et al. (eds.), Smart Technologies in Data Science and Communication,

Lecture Notes in Networks and Systems 558,

https://doi.org/10.1007/978-981-19-6880-8_1

1

2 P. Pandiaraja et al.

1 Introduction

Today hacktivists and cyber-criminals can able to write malwares with advanced

evading techniques and continuing to evolve different techniques with the intent of

assaulting end-user’s privacy. A new type of malware will be launched every day by

modifying its predecessor. The AV-Test report [1] detected more than 500 million

malware s amples in 2018. Analyzing such a huge malware sample manually is a

tedious process, so we need to have an automated malware analysis technique to

craft virus deﬁnition. Today, malware developers often integrate rootkit technique

which mainly uses an API hook technique into malware software to avoid detection.

A malware detector is a software program that can be operated locally on the victim

computer discover and locate a malware. Usually, there are two different kinds of

inputs given to a malware detector, namely unique signature of the malware or moni-

toring its behavior. After gaining these two inputs, it is easy for a malware detector

to identify malicious programs.

Nowadays, a huge number of malicious samples are submitted frequently to secu-

rity companies for analyzing whether each sample is malware or legitimate. In order

to expose hijacked API calls, we need to have a behavioral monitoring system which

categorizes malicious activities and legitimate activities. Existing tool such as [2,

3] are useful for generating reports on unknown executables which affect Windows

API calls. However, the generated reports manually need to be clustered based on

similar behavior or based on malicious activity. Analyzing a huge amount of malware

infected information to recognize its intended attack is typically a difﬁcult issue. Few

present works have existed that rely on signature relied approach.

Activities of a malware can be detected by collecting anomalous network trafﬁc

for example, and botnet attack can be identiﬁed by collectively monitoring a network

of computers and then look for computers that exhibit similar communication pattern.

Though network-based analysis approach is useful, they suffer from several limita-

tions. First, a malware packet may imitate as a legitimate packet to avert detection.

Secondly, if the payload of a malware is encrypted, then collecting and analyzing

network trafﬁc cannot reveal its presence. Thirdly, network-based approach fails to

sense malicious activities when they cannot do communication with remote attacker.

In addition to signature-based approach, another ﬁtting place to supervise and inves-

tigate malware behavior is at the end-host. We can detect a malicious code attack

even before it gets executed in the victim computer. However, current host-based

malicious code detection t echniques do not use effective models. As a result, these

models cannot capture central or essential properties of a malicious executable. An

API call graph (ACG) is a candidate solution which is a suitable data illustration of

the data and control ﬂood of software programs. Additionally, it offers information

about local data usage of a procedure and global data that can be exchanged between

different procedures. Call graph acts as a suitable tool either to study the behavior of a

program or for tracking the ﬂow values between different components of a program.

ACG can also be used to recognize programs that are never invoked. In this paper,

we present an ACG framework for detecting malicious software that uses API hook

A Graph-Based Model for Discovering Host-Based Hook Attacks 3

attacks based on the synthesis of static and dynamic analysis technique. The theme

of this paper is discussed as follows:

Static and dynamic analysis methods are used for the identiﬁcation and extraction

of API invocation calls and its associated parameters of an executable.

The API system call-dependent graph algorithm is devised to generate graphs

from the extracted information.

Finally, ACG algorithm is implemented to compare all data-dependent graphs

which can identify whether an API call made by the executable is either legitimate

or malicious hook attack.

The arrangement of this article is structured as follows. Section 2 presents the

existing techniques to detect malware attacks using a graph model, and Sect. 3

explains about the proposed system to optimally detect malware attacks. Experi-

mental environment and the evaluation results are analyzed in Sect. 4. Section 5 lists

conclusion.

2 Related Works

Graphs can be used to reﬂect the execution ﬂow of an executable ﬁle through nodes

(vertices) and edges (links) that, respectively, denote API function calls and relation-

ship between API function calls. Almost all recent malwares are being developed

from its predecessor by incorporating new features. The operations to be invoked by

a system call can be traced and modeled as a digraph that is composed of nodes and

edges, where each node signiﬁes a function call and each edge denotes calls between

functions. Such a graph is referred as a call graph.

Malicious rootkit that uses API hook technique continues to be an advancing

hazard to current computing technology. With the ever-growing explosion of these

kinds of threats, it is required to build up a new method to combat them. Though

many antivirus programs are available to classify ﬁles as being either malware or

benign, they suffer from two limitations. First, they rely on signature-based approach

which cannot identify unknown malware signature. Secondly, antivirus programs

cannot deal with malware that uses API hook technique. There are many graph-

based approaches that have been proposed in the past to dynamically analyze malware

attacks. The n-gram approach was one of the ﬁrst methods to spot malware activities

especially identifying polymorphic and obfuscated viruses [6–8]. The uniqueness of

our work is to identify API hook attacks in a novel way which utilizes a graph-based

approach.

A graph is an attractive tool for analyzing malware hits efﬁciently [8, 10]. In order

to investigate malware-based attacks in the Internet, Red team manually generating

graphs. But their work has either false positive or difﬁcult for a malicious malware

that implements API hook technique. So, researchers are using different technique

code graph or call graph to build and analyze malware attacks [4]. Guo et al. [9]

proposed a binary translation approach to analyze and detect malware execution.

The authors generated control ﬂow graph based on malware’s behavior, and then

4 P. Pandiaraja et al.

another API subgraph was generated to compare its activities. The authors in paper

[5] presented a graph-based malware inference model that relied on system call

information which can be invoked at the time of execution in a victim computer.

This method offered improved detection rate and avoids scalability issue. Many

works published in the past stress the importance of concern machine learning and

statistical methods to discover the presence of a malware. Nath and Mehtre [11]

proposes a mixture of different data samples which can be created from malicious

malware trials for detecting malware, like n-grams, instructions, and unique byte

string.

Bio-sequence-based comparison methods also exist [12] for evaluating genetic

trails which relies on genetic chain, to detect legitimate executables. Cuckoo sandbox

[13] is a most popular malware analysis tool. This open-source tool can be used

to automatically analyze many different ﬁles like emails, executables, etc., and

infer informative data. These data summarize the ﬂow direction of execution of

the malware and collect information about API function calls, registry ﬁle, and ﬂow

of network trafﬁcs. Pirscoveanu et al. [14] utilized cuckoo to achieve improved

classiﬁcation rate. Elhadi et al. [15] developed an API call graph model using depen-

dency relationship and proﬁle of function calls to discover malicious operations. This

model uses past history of known discovered malware samples to identify unknown

malware attacks. However, polymorphic packed malware would make detecting

zero-day attacks very multifarious. Mehra et al. [16] proposed a combination of

control ﬂow graph (CFG) API call graph (ACG) and histogram technique to classify

a system as wither benign or malicious. This method uses two different algorithms:

one for removing unwanted data and to manipulate a CFG and another algorithm for

generating ACG and its features.

The modiﬁed longest common subsequence algorithm (m-LCSA) [16] is utilized

to ﬁnd out the similarity linking two strings using the longest subsequences that

are common to all input strings and determine best subgraph. Khodamoradi et al.

[17] applied the decision tree method to infer statistical information on opcode from

disassembled code and then build threshold values. The opcode statistic extractor

tool is used for examining disassembled code to calculate frequency value of the

opcode which was then considered to check whether malicious code is present. Mosli

et al. [18] proposed a machine learning-based malware detection approach using

support vector machine. This method extracts different features like API function

calls, registry access, and import/export library functions from malware accessed

memory area.

An existing method [19] deployed hybrid solutions that apply various stemming

techniques and algorithms to optimize detection accuracy. Kane et al. [20] proposed

an optimized opcode method for discovering obfuscated malicious executables. In

this work, ﬁrst, support vector machine technique was applied to categorize different

type of ﬁles. Then, a histogram-based opcode density extraction procedure was exer-

cised to create opcode set during application execution. Salehi et al. [21] presented a

study on generating important features of argument return values about API function

call lists. The experimental results indicate that this research work obtained detection

rate of 99.9% with negligible false positives. Techniques for comparing nodes and

A Graph-Based Model for Discovering Host-Based Hook Attacks 5

structures of two different call graphs and their similarity level will be exercised to

detect malware in this paper. We anticipate system call traces of a function call to be

very similar with similar structures [23]. In addition, unrelated system calls should

invoke some API function calls with dissimilar structures. Few existing research

papers [24] also impose authentication system during validation of system calls of

different applications.

3 Proposed Graph-Based Model

We assume that most malicious malwares are developed by inheriting characteristics

from its previous version. For example, the various versions of TDSS rootkit are as

follows: TDL1 was implemented to load and run at the time of booting the operating

system which was designed with the intention of infecting drivers. TDL2 appears to

be same as TDL1. However, it includes different names with random string and also

imports new technique to avoid detection and removal. In order to obtain control

over the victim computer, TDL3 patches the disk controller driver. Some features of

TDL2 were updated to make detection and removal more difﬁcult. The aim of TDL4

variant is the same as that of TDL3. However, patching Master Boot Record is done

which makes infection of 64-bit systems also possible.

A directed graph G is a call graph, (V, E) in which V signiﬁes a set of nodes

that represent a function of the executable program and E is a set of ordered pairs of

elements, E_VxV [14]. A directed edge (u, v)in E represents a function call of the

program, u > v. The proposed idea attempts to optimize the accuracy of malicious

code API hook attack detection using API call graph. An API call graph is constructed

using data-dependent plot in which each node represents an API call and each edge

denotes the dependencies between two calls.

The modiﬁed LCS graph matching algorithm is applied to identify common

subgraphs and their similarity. The overall picture of our system is given in Fig. 1,

which includes two important stages which are referred to as preprocessing stage

and post-processing stage.

3.1 Preliminary Processing Stage

A function can be invoked or called itself to accomplish certain task. An API-based

call graph (ACG) is generated to show the relationship between callees and callers.

An ACG acts as a vital source to extract important features. Important function types

that can be exercised to generate necessary resource of an ACG can be classiﬁed as

follows.

nlFun: the API functions which do not reside in the system’s dynamic link library

(dll) and can automatically generate function names.

6 P. Pandiaraja et al.

Fig. 1 Structure of the proposed system

lFun: the API functions which resides in the system’s dll ﬁle.

imFun: the API functions which are imported from the system’s dll ﬁle.

xFun: the API functions which are not identiﬁed as library API functions but use

jump instruction to execute a detoured code indirectly.

There are few malicious hooks which do not follow any of the four aforementioned

API function calls, and discovering such hooks is out of the scope of this article. All

the nodes and edges of an ACG are extracted from the aforementioned API functions.

An edge with some cost for a pair of nodes can be produced using two API-based

calls associated with it. To construct an ACG, all the functions associated with an

executable will be identiﬁed by referring the system tables, namely Import Address

Table and Export Address Table. Then each function is veriﬁed to determine whether

it is a system call or not. If it is a system call, its corresponding function name and

its parameters are used to construct a call graph. An ACG is generated in which

each node contains the function name and an edge is established using its parameters

list. If two parameters are available in list that are redundant, then they reﬂect the

dependence between the recent and preceding API-based call.

3.2 Malware Detection Stage

Today, a malware writer can develop a malware by updating new features and tech-

niques with its predecessor rather than coding from scratch. This information helps

us to reduce the complexity of considering all kinds of addiction while inquiring the

A Graph-Based Model for Discovering Host-Based Hook Attacks 7

Table 1 Important features mined from ACG

Feature Comment

Node API function to be invoked through system call

Edge Relationship between two API function calls

Start node Start node in the ACG

Isolated node Function which does not call any other function

Subgraph An undirected subgraph of the ACG

Type of a node A node belongs to any of the function type (nlFun, lFun, imFun, and xFun)

data graph (DG). The objective of the malware detection stage is to generate a subset

of the DG by referring query graph (QG) and extraction of few important features

of such graphs. The important predeﬁned features that are used to detect a malware

sample are given in Table 1.

Deﬁnition 1 A subgraph Gx = (Vx, Ex) contains both start node and the last node

recently visited and the edge between these two nodes. A subgraph does not contain

a new subgraph Gy = (Vy, Ey) but Vy ⊆ Vx, where Vx represents the collection of

nodes and Ex refers to the collection of edges.

Deﬁnition 2 A best subgraph includes nodes of all the recently generated subgraphs.

The central idea of graph similarity is to generate a subgraph of DG by best matching

the QG. To apply m-LCSA, the data-dependent ACG is required to be transformed

into sequence of a string. The desired algorithm then maps a path of QG against

the path in the DG using m-LCSA. The pseudocode of the m-LCSA is given in

Algorithm 1.

Algorithm 1. Algorithm for matching call graph(s)

1. Input: Query Graph (QG) and Data Graph (DG)

2. Output: Similarity Matching

3. procedure SIMMATCHING(QG;DG)

4. rval ← 0

5. Extract paths of P1 and P2

6. Iu ← list of items rated by Ui

7. if (Paths of P1 and P2 has same label in every edge) then

8. for (every path ﬁnd similarity using LCSA) do

9. rval ← rval + LCSA(P1; P2)

10. node ← function_name

11. end for

12. end if

13. p ← paths_in_QG

14. r (QG;DG) ← r|p|

15. if (rval(P1) == rval(P2) then

16. ’Malicious API call’

17. end if

18. return rval

19. end procedure

8 P. Pandiaraja et al.

A malware detection scheme is proposed to discover unknown malicious executa-

bles using two stage procedures. First, API function calls to be invoked are modeled

as an ACG. Then, few important features are extracted from t he ACG which can be

used for training the proposed scheme. Finally, the presence of a malware sample is

discovered using features extracted from the ACG.

4 Experimental Results and Discussion

We focused on detecting malwares t hat execute PE executables on Windows platform.

As there is no standard yardstick exists for comparing two graphs to detect a malware

attack, many researchers are using their own malware datasets against various assess-

ment techniques. We have collected a malware samples dataset that contains 250

worms, 250 viruses, 250 Trojans and 250 benign legitimate programs that uses an

API hook technique. Benign programs have been collected from a computer that runs

a fresh copy of Windows 7 and Windows XP. We have run each malware sample

in an isolated environment to identify and extract AP calls and its parameters using

API monitoring tool. The API calls of an executable are identiﬁed by analyzing

binary ﬁles statistically using tool like IDA Pro [18] or by executing the binary ﬁles

dynamically in an isolated environment using a tool like API monitor [19]. Though

API-based calls can be analyzed through dynamic investigation, the malicious binary

must be executed several times to spot various execution ﬂows.

To dynamically analyze a malicious executable ﬁles, the following three opera-

tions are performed. First, the obfuscation cover is removed. Secondly, unpacking

and decryption are performed over the executable. Finally, functions are extracted

which are later assigned with a unique symbolic name. Using this extracted infor-

mation, a graph is generated for each API call. We utilize different techniques like

random forest and data mining classiﬁcation techniques to produce appropriate clas-

siﬁers. In order to verify the usefulness of our method in detecting the presence of a

malware, different malware samples with cross-validation method are applied. The

test dataset is partitioned into ten different sets—a set on the average consists of 75

malware samples and 25 benign programs. Then the proposed framework has trained

on nine sets, and the last set is taken for testing it.

In order to utilize call graphs to exactly locate API hook attacks, it is necessary

to compare a call graph that reﬂects the API hook behavior against those that reﬂect

benign behavior. To compare two call graphs, we used a graph matching Algorithm

1 to determine its similarity by matching data graphs with query graphs. When two

graphs have the same number of nodes, then it is said to be exact matching. All experi-

ments are tested on machine runs Windows 7 operating system. For every system call,

its equivalent DG is generated. Then it is compared with QG. By analyzing numerous

root malware attacks, we set a similarity threshold value of 95% to determine whether

a generated graph impersonates malicious operation or not. Suppose the determined

similarity cost of a malware surpasses the predeﬁned threshold similarity value, then

it is suspected as a malicious malware that uses API hook attack.

A Graph-Based Model for Discovering Host-Based Hook Attacks 9

4.1 Discussion

Table 2 lists the overall detection accuracy rate of different graph-based approaches

considered for malware detection. The random forest graph-based approach attained

detection accuracy rate (worm) of 97.5% which is only less than 0.4% compared to

detection rate of Trojan. Although worms and Trojans used different kinds attacking

strategies, the detection accuracy rate looks approximate. We came to the same

conclusion from the outcome obtained from the next classiﬁer, data mining. However,

the proposed method attained approximately 99% of detection accuracy and outper-

forms other methods. All the methods listed in Table 2 also obtained nearly the same

detection accuracy rate when different dataset has been used, and this can conﬁrm

the consistency of the proposed model.

The accuracy of our method is appraised by using parameters such as false posi-

tive (FP) that occurs when the test spots the legitimate programs to be malicious,

detection rate (DR), and accuracy rate (AR). The percentage of programs classiﬁed

as malicious is measured as false positive rate (FPR) that is determined using the

following formula. FPR = FP/(FP + TN). Figure 2 shows the ROC curves of all

detection of all techniques that have taken for analysis and comparison, and Table 3

depicts the AUC values of each technique.

The small twisted in curve of data mining reveals that data mining-based malware

detection suffers from more false positives. As the AUC values of both random forest

and data mining almost same, their ROC curve almost overlaps. The simulation

results of the proposed method achieve better AUC value, i.e., 99% in all cases of

malware samples than the rest of two techniques with minimal false positives. The

same training and testing datasets are employed in bigrams and graph edit distance-

based approach [22], and its comparison with our approach is presented in Table

2. In order to test the robustness of the proposed scheme, a small dataset (1%) has

been randomly chosen for training purpose which will discover the remaining 99%.

Figure 3 demonstrates the performance of proposed malware detection approach, and

its detection accuracy is compared with the bigrams and graph edit distance-based

approach.

It can be inferred that the detection accuracy of the proposed method achieves

98% when the size of the dataset is 9% of the entire dataset, and 99% can be reached

constantly when the dataset is increased from 10%. The malware detection of the

bigrams and GED-based method has achieved below 95% when the dataset is 1%

Table 2 Detection accuracy

rate of different graph-based

approaches

Approach Detection accuracy (%)

Wor m Virus Trojan

Random forest 97.1 97.3 97.5

Data mining 96.1 96.6 95.5

Proposed approach 98.9 98.7 98.8

Graph edit distance (GED) 97.6 96.7 96.2

10 P. Pandiaraja et al.

Fig. 2 ROC curves of random forest, data mining, and proposed test results

Table. 3 AUC values of

random forest, data mining,

and proposed test results

Approach Wor m Virus Trojan

Random forest 0.984 0.985 0.987

Data mining 0.987 0.981 0.982

Proposed approach 0.992 0.993 0.994

and attained overall malware detection rate 97%. There are two reasons for variation

in detection accuracy. First, different malware dataset is used for training and testing

purpose and second, the bigrams and GED-based method for dependence on the

features of known malware samples.

A Graph-Based Model for Discovering Host-Based Hook Attacks 11

Fig. 3 Performance comparison of the proposed method

4.2 Limitations

The proposed API call graph malware hook detection approach assumes that the

model graph of malware samples might be from the same malware family if the calcu-

lated similarity value becomes high. However, more advanced kernel level malwares

use an effective obfuscation technique to evade detection which can affect the overall

effectiveness of the proposed approach. Next, polymorphic malware with advanced

packing poses a serious challenge when executing and extracting all its associated

parameters. Finally, few malwares can mimic the name of various operating system

resources; as a result, exploring the similarity value between two legitimate oper-

ating system resources is a challenging task. We point this issue as a possible research

direction.

5 Conclusions

Today, most malware authors have integrated API-based hooking method to avoid

detection from antivirus measures. This article presents a method that uses graph as

a tool to discover API hook-based attacks which are based on mistrustful system call

traces and its relationship among them. In turn, system calls are represented as a call

graph and comparing graph comparison is applied. Lastly, the system discovers the

similarity value to determine the presence of a malware. The experimental evaluation

results over the testing malware samples prove that our method incurs an average

12 P. Pandiaraja et al.

of 99% detection rate over the existing schemes. In addition, our method fabricates

better space complexity. In future, we plan to incorporate recent technology like

machine learning technique to automatically predict the occurrence of any attack

that targets exploiting system resources.

References

1. Current malware statistics. https://www.av-test.org/ﬁleadmin/pdf/security_report/AV-TEST_S

ecurity_Report_2018-2019.pdf. Accessed on 2018/12/11

2. Bayer U, MilaniComparetti P, Hlauschek C, Kruegel C, Kirda E (2009) Scalable, behavior-

based malware clustering. In: Proceedings of the NDSS, pp 8–11

3. Willems C, Holz T, Freiling F (2007) Toward automated dynamic malware analysis using

CWSandbox. Secur Priv 2:32–39

4. Muthumanickam K, Ilavarasan E, Dwivedi SK (2013) A dynamic botnet detection model based

on behavior analysis. Int J Recent Trends Eng Technol 1:104–111

5. Muthumanickam K, Ilavarasan E (2014) Enhancing malware detection accuracy through graph

based model. Br J Math Comput Sci 4(15):2237–2250

6. Muthumanickam K, Ilavarasan E (2015) An effective method for protecting native API hook

attacks in user-mode. Res J Appl Sci Eng Technol 9(1):33–39

7. Muthumanickam K, Ilavarasan E (2015) COPDA: concealed process and service discovery

algorithm to reveal rootkit footprints. Malays J Comput Sci 28(1):1–15

8. Swiler LP, Phillips C, Ellis D, Chakerian S, Computer-attack graph generation tool. In:

Proceedings of the DARPA information survivability conference, pp 307–321

9. Guo H, Pang J, Zhang Y, Yue F, Zhao R (2001) HERO: a novel malware detection framework

based on binary translation. In: Proceedings of IEEE international conference ICIS, 2010, pp

411–415

10. Muthumanickam K, Ilavarasan E (2012) Automatic generation of P2P botnet network attack

graph. In: Das VV (ed) Proceedings of the third international conference on advances in

information on technology and engineering. Springer, New York, pp 288–293

11. Nath HV, Mehtre BM (2014) Static malware analysis using machine learning methods. In:

Martínez Pérez G, Thampi SM, Ko R, Shu L (eds) Proceedings of recent trends in computer

networks and distributed systems security: second international c onference, SNDS 2014,

Trivandrum, India. Springer, Berlin, 13–14 Mar 2014, pp 440–450

12. Oehmen CS, Peterson ES, Phillips AR, Curtis DS (2013) A biosequence-based approach to

software characterization. In: Proceedings of the IEEE international conference on intelligence

and security informatics, pp 330–332

13. Automated malware analysis—Cuckoo Sandbox. http://www.cuckoosandbox.org/. Accessed

2018/12/10

14. Pirscoveanu RS, Hansen SS, Larsen TMT, Stevanovic M, Pedersen JM, Czech A (2015)

Analysis of malware behavior: type classiﬁcation using machine learning. In: Proceedings

of the international conference on cyber situational awareness, data analytics and assessment

(CyberSA), pp 1–7

15. Elhadi AAE, Maarof MA, Barry BIA, Hamza H (2014) Enhancing the detection of metamorphic

malware using call graphs. Comput Secur 46:62–78

16. Mehra V, Jain V, Uppal D (2015) DaCoMM: detection and classiﬁcation of metamorphic

malware. In: Proceedings of the ﬁfth international conference on communication systems and

network technologies, pp 668–673

17. Khodamoradi P, Fazlali M, Mardukhi F, Nosrati M (2016) Heuristic metamorphic malware

detection based on statistics of assembly instructions using classiﬁcation algorithm. In:

Proceedings of the 18th CSI international symposium on computer architecture and digital

systems (CADS)

A Graph-Based Model for Discovering Host-Based Hook Attacks 13

18. Mosli R, Li, R, Yuan B, Pan Y (2016) Automated malware detection using artifacts in forensic

memory images. In: Proceedings of the IEEE symposium on technologies for homeland security

(HST), pp 1–6

19. Alabbas W, Al-Khateeb HM, Mansour A (2016) Arabic text classiﬁcation methods: systematic

literature review of primary studies. In: Proceedings of the 4th IEEE international colloquium

on information science and technology (CiSt), Tangier, pp 361–367

20. O’kane P, Sezer S, McLaughlin K (2016) Detecting obfuscated malware using reduced Opcode

set and optimised runtime trace. Secur Inform 5(2):2–10

21. Salehi Z, Sami A, Ghiasi M (2017) MAAR: robust features to detect malicious activity based

on API calls, their arguments and return values. Int J Eng Appl Artif Intell 59(1):95–98

22. Bolton AD, Anderson-Cook CM (2017) APT malware static trace analysis through bigrams

and graph edit distance. Stat Anal Data Min 10(3):182–193

23. Muthumanickam K, Ilavarasan E (2015) Optimization of rootkit revealing system resources—a

game theoretic approach. J King Saud Univ Comput Inf Sci 27(4):386–392

24. Krishnan M, Egambaram L (2020) PAM: process authentication mechanism for protecting

system services against malicious code attacks. Sådhanå 45(141):1–12

E-Health Care Patient Information

Retrieval and Monitoring System Using

SVM

K. Sumathi and P. Pandiaraja

Abstract In healthcare, the modern technologies and the smart devices have brought

an excellent results. In Intensive Care Unit, these technologies brought a more facil-

ities to take care of patient health. The Internet of Things helps the gadgets to ﬁt

with Internet. This provide a conjoin between the care taker and sick people which

leads to the duplex communication. The aim of the patient surveilling device is to

protect the patient in the intensive care unit. This system is used for analyzing the

patient essential movements and sends the report continuously to the doctor through

the help of the cloud. With the help of support vector machine algorithm, the data

get compared with available dataset. If the compared value gets reached above its

threshold value or below its threshold value the precaution message is send to the

server. The server sends the notiﬁcation message to the care taker and provides guid-

ance for giving ﬁrst aid. To collect these vital information, we need some sensors.

These sensors sense various body parameters such as the blood pressure, tempera-

ture (body heat), heart rate, and sugar level. In addition, our system also analyzes

the comma patient movement with the help of three-axis accelerometer sensor. The

heart rate is measured by using the pulse oximeter sensor; blood pressure is moni-

tored by blood pressure sensor. These sensors generate the report frequently. This

system mainly helps the patient by sending the message not only in emergency

case it also provides the precaution message if the value reached above or below its

threshold. The surveilling system helps the care taker and reduces the work pressure

and also this system overcomes the nursing staff’s shortage problem. The biomedical

data of the patient send through the server with the help of wireless communication

network and the data will be displayed on the mobile phone as well as laptop using

K. Sumathi

Department of Computer Science and Engineering, KSR Institute for Engineering and

Technology, Thiruchencode, TamilNadu, India

e-mail: thirusumathi83@gmail.com

P. Pandiaraja (B)

Department of Computer Science and Engineering, M.Kumarasamy College of Engineering,

Thalavapalayam, Karur 639113, TamilNadu, India

e-mail: sppandiaraja@gmail.com

K. A. Ogudo et al. (eds.), Smart Technologies in Data Science and Communication,

Lecture Notes in Networks and Systems 558,

https://doi.org/10.1007/978-981-19-6880-8_2

15

16 K. Sumathi and P. Pandiaraja

web browser. This multitasking implementation in our system helps the health care

especially in Intensive Care Unit.

Keywords Microcontroller ·Internet of things ·Sensor network ·Three axis

accelerometer

1 Introduction

The patient surveilling device is used for observing the patient sensual signs. In

day-to-day life people are get suffered by many diseases. The hospitals are not even

having sufﬁcient nursing staffs to monitor the patient especially in Intensive care

unit. The Internet of Things is the current growing technologies which helps the

people in medical ﬁeld. It digitalizes the nursing care with the help of our device [1].

This device is connected with the sensor and if any symptoms are observed by the

sensor, it passes the message to the doctor with the help of server. This system is not

only specially made for any age group, it can be used by everyone who is admitted

in intensive care unit [2]. The wireless sensor networks are used for transferring data

from transceiver to receiver wirelessly. The main objective of the system is to act

as an intermediate in the situation, where the doctor is not available in the hospital

but even he or she can monitor their patient by getting details with the help of this

system [3, 4].

The Arduino microcontroller is used in which the sensors are getting connected.

The Arduino board continuously reads the input from the various sensors. It uses

the cloud database to store the data. The sensed information is transmitted to the

cloud with the help of Arduino [5, 6]. The GSM technology is used for location

tracking by which it can send the analyzed report of the patient to the doctor [7].

E-medicine plays a vital role in intensive care unit for the fast record of patient

details. The support vector machine algorithm is used for analyzing purpose where

the data sensed by the sensor is compared with the ideal data. When it identiﬁed

any problem, it automatically send message to the consultant person with the help

of the server, this saves the patient life [8]. The display devices are used in many

ﬁelds. But, particularly, it is very useful in medical ﬁelds. The display device must be

very accurate because of predicting disease [9]. Generally, LCD displays are used.

The resolution of image is very important. The high-resolution of ideal information

is efﬁcient for the further treatment and diagnostics. The LCD display is composed

of constant number of pixels which helps to display the information on the screen

[10]. The LCD obtains the good quality image. Today’s scenario, most of the health

care units are built with audible alarm. The alarm is the spontaneous warning device

which helps in hospitals to alert and convey the message quickly and effectively.

Alarms in sensitive care unit that are enacted from many number of devices.

E-Health Care Patient Information Retrieval and Monitoring … 17

2 Related Works

The patient surveilling device is used to collect the data from the sick people in

intensive care unit with the help of Internet of Things. These information are sensed

through various sensors. Generally, the patient surveilling device helps to reduce the

works of care takers and it also saves the patient life [11]. This system works with the

help of IOT device like Arduino board or Raspberry Pi. The board consists of various

signals like analog or digital [12]. Each sensor connected with the Arduino board

to send the sensed information to the concerned person. The sensed information is

used to analyze for decision making and further to predict the diseases. The system is

classiﬁed into three stages. At ﬁrst Stage the biosensor are used to predict the disease

[13].

The sensor like heart rate sensor, temperature sensor, and blood pressure sensor.

They are used to analyze patient daily status and send the sensed information to the

IOT device like Arduino or Raspberry Pi [14]. In this technical world, everything

was smart. In this smart world the smart devices like Arduino or Raspberry Pi are

interconnected with the objects around the environment. In medical ﬁeld, Internet

of Things plays a major role, and it is more helpful to monitor and track. In second

stage, the sensed information is transmitted through server [15, 16]. To operate this

device, we need Wi-Fi connection. The Arduino board need to connect with the Wi-

Fi network using Wi-Fi module. Then only the information can be reached to the

doctor or the care taker [17]. Once the information is reached, it helps the care taker

or doctor for further treatment. At ﬁnal stage, the system triggers alarm in case of

emergency situation. So that the care taker gets alert and pass the information to the

doctor. In this system, the main component is microcontroller board [18, 19].

The Arduino board consists of microchip, analog, and digital pins. It also consists

of USB port which makes us to connected with the laptop/pc’s. The board consists

of transceiver and receiver and also some light emitting diode. The Arduino uses

the USB port or external power supply to draw power automatically. In this system,

they applied support vector machine algorithm used for data classiﬁcation [20]. The

patient-trained dataset is stored in the network, where the database gets updated.

The support vector machine algorithm checks the data’s threshold value; either it is

above or below; and during emergency case, it sends the message to care taker [21].

Then the messages pass through the mobile or display device like monitor/pc. The

display device mainly uses liquid crystal displays. Generally, the LCD is used for

high-resolution image processing [22]. If the resolution of the image is high, then

it is very helpful for the doctor to predict the disease very easily and the diagnostic

takes place in efﬁcient way. The medical parameters value such as heart rate, blood

pressure and temperature are send to the care taker and doctor in digital form [23,

24]. So that they can analyze these values and monitor the patient in more precise

way in Fig. 1.

18 K. Sumathi and P. Pandiaraja

Fig. 1 Existing system for patient health monitoring

3 Proposed Model in IoT

3.1 Microcontroller

Esp8266 is called as a Wi-Fi module, but actually it is a microcontroller. Esp8266

is used to work with two ways. The ﬁrst way is by using AT commands. Another

way is by using Arduino IDE. The AT commands is used to send the data from

Arduino to ESP. The Esp8266 module has eight pins which are used to perform

various functions. The maximum input voltage for Esp8266 is 3.3 V. If the input

voltage is greater than 3.3 V which causes damage to the module. Node MCU is an

open source LUA-based ﬁrmware. The development board of Node MCU V3 which

is used to run on Esp8266. The features of Node MCU are which has 4 MB ﬂash

memory and 50 K usable RAM. The Node MCU consists of 30 pins. While 15 pins

are at the left and the other 15 pins are at the right. It has 16 pins for general purpose

input and output. Out of this, 16 pins for digital input and output 10 pins are used,

and 1 pin is used as the analog pin in Fig. 2.

Fig. 2 Node MCU with

ESP8266

E-Health Care Patient Information Retrieval and Monitoring … 19

3.2 Sensors

A sensor converts the impulses such as light, heat, sound, and motion into elec-

trical signals. These sensed information are gathered and sent to the interface which

converts them into a binary code then this binary information sent to the computer

for further process. There are two types of sensors; they are blood pressure and

temperature sensor.

3.3 Blood Pressure Sensor

Blood pressure is deﬁned as pressure exerted by blood vessels while circulating the

blood. It is expressed in the ratio of systolic and diastolic pressure. Blood pressures are

measured by using sphygmomanometer but the blood pressure sensor itself measures

the artery without using mercury. In blood pressure sensors non-invasive method

is used to measure the blood pressure the normal blood pressure range is about

120/80 mmHg. When the range is above 180/120 mmHg means the person is in

serious condition.

3.4 Temperature Sensor

There are several types of temperature sensors they are thermocouples, resis-

tance temperature detectors, thermostats, infrared and semiconductor. To monitor

the human body temperature uses the thermocouples and resistance temperature

detectors. The normal body temperature for a person is about 37 °C.

3.5 Three Axis Accelerometer Sensors.

This three axis accelerometer is used to monitor the coma patients. This sensor

measures the acceleration of the body and compares the result with normal person.

3.6 Display Device

The display devices are used to collect the signals and display them on the monitor or

screen. In general the LCD displays are used in medical ﬁelds. Because the produce

good resolution of image and the doctor can easy to predict.

20 K. Sumathi and P. Pandiaraja

3.7 Alarm System

The alarm system is used to alert the people. Normally, the alarm system is used in

industries to alert the workers in case of emergency. This alarm now a day’s used

in many organizations and even in hospitals to protect the person’s life. In medical

ﬁeld, the alarm is used to save the patient life by alerting the care taker. Normally,

in hospital, mild alert sound is used in order not to disturb the other person in the

hospitals.

3.8 SVM Algorithm

In this system, the support vector machine algorithm is used. The SVM algorithm

is a supervised learning algorithm. This algorithm is used to compare the data with

the help of hyper plane. The SVM works by mapping the data objects in the multi-

dimensional space. The SVM is classiﬁed into two types. The linear SVM is used to

draw a linear straight line, and it is used to ﬁnd the difference between two classes.

In nonlinear, SVM we cannot use two dimensions to ﬁnd the data’s. We need one

more dimension to identify the classes in Fig. 3.

Fig. 3 Support Vector Machine classiﬁer algorithm

E-Health Care Patient Information Retrieval and Monitoring … 21

3.9 Cloud Server

Cloud server is considered to be a physical or virtual server the data’s get collected,

stored and hosted through the internet which runs on the cloud computing platform.

The cloud stores the data in the cloud storage in which the computer data’s and the

digital form of data are stored in the large logical pools. And, it is safe to store the

data’s in the cloud that can be easily retrieved or get accessed anywhere from anytime

through the internet server. It is impossible to delete all the data’s from the cloud. In

our surveilling device the sensed information get stored in the cloud. These collected

data are run through the cloud computing platform that send the notiﬁcation or any

other messages to the particular doctor or the nursing staffs. With the help of web

portal address the sensed information get viewed.

4 Proposed Work

Our proposed system is used to protect the patient in intensive care unit. This system

helps the doctors and nursing staffs to reduce their stress level and also protect the

patient’s life. The surveilling device collects the patient blood pressure level, pulse

rate, and also body temperature with the help of sensors. The sensed information

from the sensor is connected to the node MCU Wi-Fi module. With the help of

the Wi-Fi module the collected data get compared with the dataset with the help of

support vector machine algorithm. The collected information get compared with the

available data and display the message through the help of cloud server. When the

compared data is above or below the threshold value, the notiﬁcation is passed to

the doctors and nursing staffs. In normal system, it only pass the information during

emergency case. But in our system, we ﬁx the threshold and pass the message during

emergency case and also pass precaution message. In addition, our system helps to

monitor the coma patient with the help of the sensor. The compared data pass on the

cloud which helps the doctor to access the report from any location and also it is very

helpful for the doctors to diagnosis the patient condition without reaching the health

care center in Fig. 4.

5 Results and Discussion

The modular system is used which helps to analyze the patient vital signs and also this

support vector machine algorithm works better to provide the results. To analyze the

body parameters by using various sensors which is connect to the cloud to diagnosing

the patients continuously in intensive care unit by monitoring through sensors and

also by using three-axis accelerometer sensors to monitor the coma patient move-

ment. The following ﬁgure shows the use case diagram of our proposed work and it is

22 K. Sumathi and P. Pandiaraja

Fig. 4 Proposed system for patient health monitoring

divided into three modules Analyzing body parameters through sensors, diagnosing

patient in intensive care unit and sending notiﬁcation through cloud in Fig. 5.

The different wearable sensors are used to measure the coma patients body param-

eters like body temperature, muscle activity, pulse rate, and glucose level in the blood.

These tiny sensors are direct contact with skin of the patient and it can be used to

ﬁnd the several diseases like fever, blood pressure, and sugar level. Then, numbers

of physiological parameters collected from these sensors are most preferred by the

doctors due to its accuracy.

A s mall hardware is used to preprocessing the acquired data and transmits desire

result to the other device through communication software. Normally, sensors are

small in size, light weight, and disconcerting mobility and movement of the patients.

The energy efﬁcient components are used to operate the sensors and these compo-

nents may be working continuously without charging and replacement. The accurate

and secure recorded information of the coma patient in any location is reported to the

Fig. 5 Use case diagram for proposed model

E-Health Care Patient Information Retrieval and Monitoring … 23

Fig. 6 Analyzing body parameters through sensors

doctors using the data transmission system present in the communication software.

The result of this module is represented in the Fig. 6.

5.1 Diagnosing Patient in Intensive Care Unit

Remote monitoring of patients target several sub-groups of patients, such as patients

diagnosed with chronic illnesses, patients with mobility issues, or other disability,

post-surgery patients, neonates, and elderly patients. Automated health care services

are essential for our society and it reduces the burden of the nursing staff. The trans-

parency of this system increases the trust level of the patients. During the emergency

conditions, the buzzer and LED (Fig. 7b) present in alarm system alerts the doctors,

and she/he can act more quickly and handle the situation easily. The general steps in

diagnosing patient in ICU are represented in Fig. 7a.

5.2 Sending Notiﬁcation Through Cloud

The real-world application challenges are solved by using the proxy-based approach

for end-to-end communication between the IoT-enabled living systems. It’s a chal-

lenge for large organizations to ﬁnd cloud monitoring solutions [21–24] that provide

support in identifying emerging defects and troubleshooting them before they turn

into major issues. A sink node collects the signal from the sensor and forwards that

information to cloud via Wi-Fi or Bluetooth. The data stored in the cloud is further

processed whenever necessary. After processing the data and ﬁnd out any emer-

gency then notiﬁcation is send to the doctor using cloud enabled smart phone which

is depicted in Fig. 8.

24 K. Sumathi and P. Pandiaraja

Fig. 7 a Steps in diagnosing patient In ICU, b Diagnosing patient using LED display

5.3 Support Vector Machine

Support vector machine or SVM is one of the most popular supervised learning

algorithms, which is used for classiﬁcation as well as regression problems. The goal

of the SVM algorithm is to create the best line or decision boundary that can segregate

n-dimensional space into classes so that we can easily put the new data point in the

correct category in the future.

This best decision boundary is called a hyper plane SVM chooses the extreme

points/vectors that help in creating the hyper plane. These extreme cases are called

E-Health Care Patient Information Retrieval and Monitoring … 25

HEALTH MONITOR

S.NO DETAILS DATE & TIME

1 T:23.93.75.07 HB:0 BP:94 15.02.2020 06:55:44 AM

2 T:25.39.75.70 HB:0 BP:94 15.02.2020 06:55:00 AM

3 T:24.41.75.95 HB:0 BP:94 15.02.2020 06:54:10 AM

4 T:33.69.92.64 HB:117 BP:171 15.02.2020 06:53:20 AM

5 T:23.93.75.07 HB:88 BP:171 15.02.2020 06:52:443AM

6 T:32.71.90.89 HB:120 BP:171 15.02.2020 06:52:07 AM

7 T:23.44.74.19 HB:120 BP:171 15.02.2020 06:51:11AM

8 T:26.86.80.34 HB:80 BP:123 15.02.2020 06:50:14 AM

9 T:23.93.75.07 HB:34 BP:92 15.02.2020 06:49:35 AM

10 T:30.76.87.37 HB:100 BP:157 15.02.2020 06:48:58 AM

11 T:25.39.77.70 HB:14 BP:94 15.02.2020 06:24:58 AM

12 T:23.44.74.19 HB:0 BP:94 T:23.44.74.19 HB:0 BP:94 15.02.2020 06:23:30 AM

13 T:25.39.77.70 HB:0 BP:94 T:25.39.77.70 HB:0 BP:94 15.02.2020 06:22:49 AM

14 T:24.90.76.82 HB:0 BP:94 T:24.90.76.82 HB:0 BP:94 15.02.2020 06:22:09 AM

15 T:24.41.75.95 HB:0 BP:94 T:24.41.75.95 HB:0 BP:94 15.02.2020 06:21:28 AM

16 T:24.90.76.82 HB:0 BP:94 T:24.90.76.82 HB:0 BP:94 15.02.2020 06:20:447AM

17 T:24.41.75.95 HB:19 BP:510 T:24.41.75.95 HB:19 BP:510 15.02.2020 06:20:06 AM

18 T:24.41 HB:0 BP:510 T:24.41 HB:0 BP:510 15.02.2020 06:02:01 AM

19 T:24.41 HB:0 BP:511 T:24.41 HB:0 BP:511 15.02.2020 06:01:05 AM

20 T:24.41 HB:0 BP:511 T:24.41 HB:0 BP:511 15.02.2020 06:00:25 AM

Fig. 8 Sending notiﬁcation through cloud

as support vectors, and hence algorithm is termed as support vector machine. SVM

works by mapping data to a high-dimensional feature space so that data points can

be categorized, even when the data are not otherwise linearly separable. A separator

between the categories is found, and then the data are transformed in such a way that

the separator could be drawn as a hyper plane in Fig. 9.

The comparison of various approaches in naïve Bayes, decision tree, zero R, and

support vector machine for true and false classiﬁcation approaches are mentioned in

Table 1 and its performance represents in Fig. 10.

Accuracy and precision of the different classiﬁcation algorithms are calculated

by using the following formula

Accuracy =TP + TN

TP + TN + FP + FN (%)

Precision =TP

TP + FP (%)

The comparison of accuracy and precision of various approaches such as naïve

Bayes, decision tree, zero R, and support vector machine are mentioned in Table 2

and its results are represented in Fig. 11.

26 K. Sumathi and P. Pandiaraja

Fig. 9 Support Vector Machine decision boundary algorithm

Table 1 Comparison of true and false classiﬁcation approaches

Approach Coma patient data (%)

True classiﬁcation (%) False classiﬁcation (%)

Naïve Bayes 84.13 15.87

Decision tree 67.89 32.11

Zero R 97.87 2.13

SVM 86.87 13.13

Fig. 10 Comparison of true

and false classiﬁcation model

0.00%

20.00%

40.00%

60.00%

80.00%

100.00%

Naïve

Bayes

Decision

tree

Zero R SVM

Percentage of instances

Algorithms

True classiﬁcaon False classiﬁcaon

E-Health Care Patient Information Retrieval and Monitoring … 27

Table 2 Comparison of accuracy and precision of various approaches

Comparison of accuracy and precision

Approach TP FP Accuracy Precision

Naïve Bayes 0.833 0.339 92.411 71.075

Decision tree 0.962 0.019 98.137 98.063

Zero R 0.992 0.012 99.602 98.805

SVM 0.868 0.142 93.866 85.941

Fig. 11 Comparison of

accuracy and precision

0

20

40

60

80

100

120

Naïve

Bayes

Decision

tree

Zero R SVM

value

Algorithms

Accuracy

Precision

6 Conclusion

This system helps the ill people and also the doctors to detect the patient physiolog-

ical signs and provide the doctors the best report and reduces their work. Our system

overcomes the disadvantages of existing system. The support vector machine algo-

rithm is used which helps to analysis the patient vital signs and also this algorithm

works better to provide the results. The main idea of our system is we use the cloud

server to pass the data and also we use three axis accelerometer sensor to monitor the

coma patient movement. This system provides the efﬁcient and good health services

to the patients. The feature of the system is to examine the patient from anywhere and

anytime. In our system we used future technologies and also we use various sensors

and it is easy to use.

References

1. Jaiswal S, Katake R, Kute B, Ranjane S, Mehetre PD (2016) Survey of health monitoring

management using internet of things (IOT). Int J Sci Res 15(11):243–2246

2. Sara GS, Sridharan D (2014) Routing in mobile wireless sensor network: a survey. Telecommun

Syst 57(1):51–79

3. Pustiek M, Beristain A, Kos A (2016) Challenges in wearable devices based pervasive wellbeing

monitoring. Int Conf Ident Inf Knowl Internet of Things, IIKI:236–243

28 K. Sumathi and P. Pandiaraja

4. Anusha N, Deepthi SM, Madhu M, Manish Adithya HM, Amutharaj J (2019) Smart healthcare

system assisted by IOT and emergency app. Int Res J Eng Technol l6(5):4551–4554

5. Kajaree D, Behera R (2017) A survey on healthcare monitoring system using body sensor

network. Int J Innov Res Comput Commun Eng l5(2):1302–1309

6. Meghana K, SudhirBabu A, KoteshwaraRao K, SreeLakshmi D (2019) Iot based patient health

monitoring system. J Inf Comput Sci 9(8):639–647

7. Pandiaraja P, Deepa N (2019) A novel data privacy-preserving protocol for multi-data users by

using genetic algorithm. J Soft Comput 23(18):8539–8553

8. Deepa N, Pandiaraja P (2019) Hybrid context aware recommendation system for E-health care

by merkle hash tree from cloud using evolutionary algorithm. J Soft Comput 24(10):7149–7161

9. Zhang Y, Liu H, Su X, Jiang P, Wei D (2015) Remote mobile health monitoring system based

on smart phone and browser/server structure. J Healthc Eng 6(4):717–738

10. Saranya M, Preethi R, Rupasri M, Veena S (2018) A survey on health monitoring system by

using IoT. IJRASET 6(3):778–782

11. Kishore KH, Nath KS, Krishna KV, Kumar DP, Manikanta V, Basha FN (2019) IOT based

smart health monitoring alert device. IJITEE l.8(6S):157–160

12. Vijayakumar P, Pandiaraja P, Balamurugan B, Karuppiah M (2019) A novel performance

enhancing task scheduling algorithm for cloud based e-health environment. Int J E-Health

Med Commun 10(2):102–117

13. Pandiaraja P, Deepa N (2020) E health care data privacy preserving efﬁcient ﬁle retrieval from

the cloud service provider using attribute based ﬁle encryption. J Ambient Intell Humanized

Comput 12(5):4877–4887

14. Cirani S, Macro P (2015) Picone from wearable computing for the internet of things. IEEE

Comput Soc:35–41

15. Pandiaraja P, Vijayakumar P (2017) Efﬁcient multi-keyword search over encrypted data in

untrusted cloud environment. In: Proceedings of the 2nd international conference on recent

trends and challenges in computational models (ICRTCCM ’17), pp 251–256

16. Mahalakshmi S, Latha R (2019) Artiﬁcial intelligence with the internet of things on healthcare

systems: A Survey IJATCSE 8(6):2847–2854

17. Shankar, A, Pandiaraja P, Sumathi K, Stephan T, Sharma P (2020) Privacy preserving E-voting

cloud system based on ID based encryption. J Peer-to-Peer Networking Appl. https://doi.org/

10.1007/s12083-020-00977-4

18. Sumathi K, Pandiaraja P (2020) Dynamic alternate buffer switching and congestion control in

wireless multimedia sensor networks. J Peer-to-Peer Networking Appl 13(6):2001–2010

19. Sravanan S, Abiramai T, Pandiaraja P (2018) Improve efﬁcient keywords searching data

retrieval process in cloud server. In: International conference on intelligent computing and

communication for smart world (I2C2SW), pp 219–223

20. Rajesh Kanna P, Pandiaraja P (2019) An efﬁcient sentiment analysis approach for product

review using turney algorithm. J Procedia Comput Sci 165:356–362

21. Hemasri M, Sumathi K (2019) SLA based combinatorial resource allocation model in cloud

computing. Int J Adv Sci Technol 29(7S):1151–1159

22. Sumathi K, JoseTriny K (2019) Secured data outsourcing in cloud with ECC encryption. Int J

Innov Technol Exploring Eng 8(8):1223–1227

23. Sumathi K, Naveena K, Prashanth P, Revanthkumar S, Srikeerthanaa AP (2019) E-Health based

patient surveilling device. Int J Emerg Trends Eng Res 8(3):792–796

24. Sumathi K, Adchaya P, Jayasri M, Nandhini B, PavithraJT (2019) Smart irrigation and agri-

culture monitoring system using cloud server based on IoT. Int J Adv Trends Comput Sci Eng

9(2):1082–1086

Number Plate Recognition Using Optical

Character Recognition (OCA)

and Connected Component Analysis

(CCA)

Puppala Ramya , Tummala Haswanth Chowdary, Pisupati Krishna Teja,

and Tadepally Hrushikesh

Abstract The number of automobiles has expanded dramatically during the last few

decades. As a result, tracking them became extremely difﬁcult. In the event of a trafﬁc

ticket or excessive speeding, identifying the automobile owner has become nearly

impossible. Image processing was used to identify car license plates to make this

practicable. That license plates will be retrieved from collected photographs utilizing

perception and computer vision algorithms, and then we will utilize OCR information

to recognize the license number. OCR stands for optical character recognition. We

employ cameras to record high-speed photos of number plates for image recognition,

and image processing algorithms to identify and validate the sequence of characters,

as well as to convert the number plate image to text. We now utilize number plate

recognition (NPR) to detect license plates. NPR is a computer vision technique that

enables equipment to scan license plates on automobiles swiftly and automatically

without the need for human intervention. Image processing methods include Hidden

Markov models, linear ﬁltering, neural networks, and others. Our goal here is to

recognize license plates so that we can readily follow automobiles in the event of a

trafﬁc penalty or excessive speeding.

Keywords Vehicle number plate ·Number plate recognition (NPR) ·Character

segmentation ·Recognized characters

1 Introduction

In recent years, number plate recognition or license plate recognition has proven to be

one of the most effective methods for vehicle surveillance. It can be used in a variety of

public locations for a variety of reasons, including trafﬁc safety enforcement, car park

systems, and automatic vehicle parking systems. The four steps of an NPR algorithm

are as follows: (1) Image capture of a vehicle, (2) Identiﬁcation of license plates, (3)

P. Ramya (B) · T. H . Cho wdar y · P. K . T e j a · T. Hrushikesh

Department of Computer Science and Engineering, Koneru Lakshmaiah Education Foundation,

Guntur, India

e-mail: mothy274@kluniversity.in

K. A. Ogudo et al. (eds.), Smart Technologies in Data Science and Communication,

Lecture Notes in Networks and Systems 558,

https://doi.org/10.1007/978-981-19-6880-8_3

29

30 P. Ra m y a e t a l .

Character segmentation, and (4) Recognition of characters. We will create software

for a real-time license plate recognition system in this project. Using OpenCV and

optical character recognition, this system recognizes and reads car license plates

automatically. It detects the license plate using OpenCV’s contour function. Lastly,

the license plate numbers are read using optical character recognition. Connected

component analysis was the method used to segment the images (CCA). Connected

regions denote that all of the pixels in the region are part of the same item. When

two pixels with the same value are adjacent to one other, they are said to be linked.

A license plate picture recognized in a car image is the ﬁrst output obtained after

running the software. This is used as input for the next step, and CCA is used to bind

the characters in the plate using this image.

As seen in Fig. 1, the ﬁrst task may appear simple, but it can be difﬁcult to capture

a moving car in real time while making sure that all of its components, particularly

the license plate, are visible. Many algorithms today can recognize license plate

numbers in less than 50 ms. Identifying an NPR system’s effectiveness can be quite

important. Along with a visual and NPR quality assessment, a thorough study of

license plate identiﬁcation is offered (LPR). The terms “number plate” and “license

plate” are used interchangeably in this literature. Each NPR is discussed in great

length in Sect. 2.

Fig. 1 Steps of number

plate recognition model

Number Plate Recognition Using Optical Character … 31

1.1 The Purpose of This Paper

Since it is impossible to differentiate between techniques, a variety of publications

built on the methods depicted in Fig. 1 are examined and categorized in accordance

with the methodology used in each approach. Since commercial products frequently

promise greater accuracy than is really achieved for promotional purposes, a survey

of them is not within the scope of this study. The following sections make up the

remaining text of this essay: A overview of several number plate detection tech-

niques is presented in Sect. 2. Section 3 discusses character segmentation t echniques,

whereas Sect. 4 discusses character recognition techniques (Figs. 2 and 3).

2 Detection of License Plates

Number plate recognition algorithms can be classiﬁed into more than one category

based on different methodologies. When identifying a vehicle number plate, the

following factors must be taken into consideration: (1). In a car image, the plate size

can change. (2). Anywhere in the car will have a licence plate. (3). Setting for a plate:

The colour of a licence plate’s backdrop may change based on the type of vehicle.

For instance, the background of a government vehicle number plate may differ from

that of other public cars. (4). A screw could be a character on a plate. The picture

segmentation approach can be used to extract a number plate. In diverse literature,

there are a variety of image segmentation methods. Color segmentation is used in

some plate segmentation methods. The following sections outline popular number

Fig. 2 Hardware setup for

NPR System

32 P. Ra m y a e t a l .

Fig. 3 Flowchart of the system

plate extraction methods, followed by a detailed examination of picture segmentation

techniques used in various NPR or LPR publications (Figs. 4 and 5).

2.1 Binarization of Images

The conversion of a picture to black and white is known as image binarization. This

approach uses a threshold to categorize pixels as black or white. The key issue,

however, is determining the appropriate threshold value for each image. Choosing

an optimal threshold value might be challenging, if not impossible, at times (Figs. 6,

7, 8, 9 and 10).

2.2 Detecting the Edges

For feature detection or extraction, edge detection could be a fundamental method.

In most cases, an object boundary with connected curves is the result of executing a

footing detection technique. Applying this method to complex photos becomes quite

difﬁcult because it should lead to object boundaries with disconnected curves. Canny,

Canny-Deriche, Differential, Sobel, Prewitt, and Roberts crosses are a number of the

sting detection algorithms and operators that are employed.

Number Plate Recognition Using Optical Character … 33

Fig. 4 Images taken using a USB Camera

Fig. 5 Number plate extraction using smearing algorithm

2.3 Connected Component Analysis (CCA)

Blob extraction, also known as CCA, is a method for labeling subsets of related

components in a unique way using a heuristic. It s cans a binary image and identiﬁes

pixels based on their connectivity conditions, such as the current pixel’s North-East,

North, Northwest, and West (8-connectivity). 4- Only the north and west neighbors

34 P. Ra m y a e t a l .

Fig. 6 Binary image

Fig. 7 Inverted binary image

Fig. 8 Line separation using row segmentation

of the current pixel are connected. The approach is more efﬁcient and can be used

to perform automated picture analysis. This technique can be utilized for both plate

and character segmentation.

Number Plate Recognition Using Optical Character … 35

Fig. 9 Character separation using column segmentation

Fig. 10 Recognize character using OCR

2.4 Mathematical Morphology

Set theory, lattice theory, topology, and random functions are all used in mathematical

morphology. It is most typically applied to digital images, although it can also be

applied to other spatial structures. It was originally designed to process binary images,

but it was later expanded to handle grayscale functions and images. Erosion, dilation,

opening, and shutting are some of the basic operators (Table 1).

2.5 Related Work in the Number Plate Detection

Plate detection procedures such as those described in the preceding sections are preva-

lent. Aside from these methods, plate detecting methods have been discussed in the

literature. It is impossible to undertake a category-by-category analysis because most

of the strategies presented in this literature use multiple approaches. The sections

that follow discuss the various number plate segmentation algorithms. The sliding

concentric window (SCW) technique aims to identify the region of interest (ROI)

more quickly. From the upper left corner of the image, two concentric windows move

in two steps. To adapt camera distance and brightness under varied circumstances, the

36 P. Ra m y a e t a l .

Table 1 Some basic set operations

S. No Set operations

1Empty (Null) Set: ∅

2Subset of sets A and B: A ⊂ B

3Union of sets A and B: A ∪ B={xlx ∈ Aorx ∈ B}

4 Intersection of sets A and B: A ∩ B={xlx ∈ Aand x ∈ B}

5Disjoint/Mutually exclusive between sets A and B: A ∩ B =∅

6 Complement of a set A (with respect to a deﬁned universe): AC = xlx /∈ A

7 Difference of sets A and B: A\B = A–B = xlx ∈ Aand x /∈ B

8Reﬂection (Transposition) of a set A: ˆ

A if A is symmetric

9 Translation of a set A by a vector z = (z1, z2): AZ = {xlx =a +z,a ∈ A}

Fig. 11 a A number plate with non-standard stylish font, b number plate with distorted angle, c

number plate with distorted angle

authors developed a revolutionary technique. Finding contours and related compo-

nents, selecting a rectangle region based on size and aspect ratio, initial learning

for adaptive camera distance/height, localization based on the histogram, gradient

processing, and closest mean classiﬁer are some of the phases in the license plate

detection method. Once these steps are complete, the ﬁnal detection result is sent for

tracking (Figs. 11, 12 and 13).

3 Character Segmentation

Characters are checked for the next step after locating the number plate. Character

segmentation can be done using a variety of approaches, just like plate segmentation.

It is impossible to discuss approaches by category because many fall into more

than one. This section discusses frequently related work in this ﬁeld, followed by a

discussion. Some of the approaches outlined in Sect. 2, such as image binarization

and CCA, can also be used for character segmentation.

Number Plate Recognition Using Optical Character … 37

Fig. 12 Blurry number plate

Fig. 13 Number plates detected and recognized

For character segmentation, H. Erdinc Kocer used contrast extension, median

ﬁltering, and blob coloring techniques. To make the image sharper, contrast exten-

sion is applied. Histogram equalization, according to H. Erdinc Kocer, is a popular

approach for improving the appearance of a low-contrast photograph. Unwanted

noisy regions are removed using median ﬁltering. To detect closed and contact-less

zones, the blob coloring approach is applied to a binary image. By getting the connec-

tions into four directions from a zero-valued backdrop, this scanning procedure is

used to discover the independent areas (Fig. 14).

38 P. Ra m y a e t a l .

Fig. 14 Character segmentation

4 Character Recognition

The identiﬁcation and creation of editable text from visual text is aided by char-

acter recognition, which is covered in more detail in Sect. 2. The bulk of number

plate identiﬁcation algorithms employ a single character recognition approach. This

section goes into detail on each strategy. The optical character recognition (OCR)

tool is used by several algorithms to recognize characters. There is a wide range

of software available for OCR processing. Tesseract, a Google-maintained open-

source OCR tool with multilingual support, is one of the multilingual open-source

OCR tools. It is used to identify characters. The author tweaked it to get a character

recognition rate of 98.7%. In the Markov Random Fields (MRF) model of character

extraction, randomization is utilized to model the uncertainty in pixel assignment.

To maximize a posteriori probability, character extraction is done as an optimization

problem based on prior knowledge (Figs. 15 and 16)

Fig. 15 Recognition of characters

Number Plate Recognition Using Optical Character … 39

Fig. 16 Extraction of recognized characters from the image

5 Conclusion

As each step is dependent on the previous phase, it is obvious that NPR is a complex

method. It is currently impossible to achieve 100% overall accuracy because each step

is dependent on the previous phase. Different lighting conditions, car shadows, and

non-uniform license plate character sizes, as well as font and backdrop color, all have

an effect on NPR’s performance. Some systems are designed to perform just in these

limited circumstances, and they may not be accurate enough in other situations. Some

of the systems have been created and are being used in a given country. Just a small

number of NPR have been built for India. As a result, developing such a system for a

country like India has a lot of potentials. This paper presents a thorough examination

of recent advancements and potential trends in NPR, which will be useful to scholars

working on similar projects. In this project, we conclude that everyone must follow

the government’s rules, which require everyone to use a government of India number

plate. Nowadays, most people use the fancy number plates. This is a violation of the

government’s rules. So, our idea is that people with fancy license plates are scanned

by the camera and a challan is generated for the following person. Once this is put

into action, everyone will keep their government license plates and follow the rules

of the government.

Bibliography

1. Mitra D, Banerjee S (2016) Automatic number plate recognition system: a histogram-based

approach. IOSR J Electr Electron Eng 11:26–32

2. Chen W-K (2016) Linear networks and systems. Goyal BA, Bhatia R Various techniques for

number plate recognition-a review. Int J Comput Appl 143, Singh B, Kaur M, Singh D, Singh

G Automatic number plate recognition system by character position method. Int J Comput

Vision Rob 6(1–2):94–112

3. Qadri MT, Asif M (2009) Automatic number plate recognition system for vehicle identiﬁcation

using optical character recognition. In: Proceedings of the 2009 international conference on

education technology and computer. Singapore, pp 335–338

40 P. Ra m y a e t a l .

4. Laroca R, Severo E, Zanlorenzi LA et al (2018) A robust real-time automatic license plate recog-

nition based on the YOLO detector. In; Proceedings of the 2018 international joint conference

on neural networks (IJCNN). Rio de Janeiro, Brazil, pp 1–10

5. Bhogale P, Save A, Jain V, Parekh S (2016) Vehicle license plate detection and recognition

system. Int J Comput Appl 137(9):31–34

6. Sharma G (2018) Performance analysis of vehicle number plate recognition system using

template matching techniques. J Inf Technol Softw Eng 8(2)

7. Council E (2010) Directive 2010/31/EU of the European Parliament and of the council of 19

May 2010 on the energy performance of buildings. Off J Eur Union 153:13–35

8. Sajjad K, Automatic License Plate Recognition Using Python and Opencv, Department of

Computer Science and Engineering, MES College of Engineering, Kuttippuram, Kerala, 2010.

Muñoz J, Craft M, Ahmadian M, Wrobel S (2013) Multifunction LIDAR sensors for non-

contact speed measurement in rail vehicles: Part I—system installation and implementation.

In: Proceedings of the 2013 joint rail conference. Philadelphia, PA, USA

9. Chang S-L, Chen L-S, Chung Y-C, Chen S-W (2004) Automatic license plate recognition.

IEEE Trans Intell Transp Syst 5(1):42–53

10. Agarwal P, Chopra K, Kashif M, Kumari V (2018) Implementing ALPR for detection of trafﬁc

violations: a step towards sustainability. Procedia Comput Sci 132:738–743, 2018. Vehicle

number plate detection. In Soft computing: theories and applications, pp 453–461, Springer,

Berlin, Germany, 2019.

Cartoonify an Image with OpenCV Using

Python

Puppala Ramya , Penki Ganesh, Kopanathi Mouli,

and Vutla Naga Sai Akhil

Abstract This article describes a technique for generating cartoon-like images from

digital pictures. The method used today is different from how things were done in

the past. This study focuses on the various tactics used during the process that, when

used layer by layer, provide a product that is well balanced. We usually research how

to combine several functions in a speciﬁc way to provide a ﬁltered and composite

outcome. Various functions’ mathematical foundations and mechanisms have also

been discussed. This article provides examples of a variety of cartooning techniques.

Any of the methods given here can be used to turn any type of obtained photograph

into a cartoon, including pictures of people, mountains, trees, ﬂora and fauna, etc.

Keywords Use ﬁlters to cartoonize images in Python ·Including the bilateral ·

Gaussian ·Pencil edge ·Pencil sketch ·Laplacian ·Median ﬁlters ·Computer

vision

1 Introduction

Cartoons are pictures of ﬁctional or based on real-people ﬁgures. These days, semi-

realistic or non-realistic paintings that satirically or humorously reﬂect a situation or

an event are popular. One of the earliest instances of traditionally animated movies is

Fantasmago-rie (1908), in which every frame was drawn by hand. This tradition of

hand-drawn animation frames is still prevalent today. Walt Disney caught up to the

competition with their excellent animated series and improved animation cartoons

to a new level. Cartoonists used to hand-draw these cartoons in the past, but as

“Anime” gained popularity, it became more challenging for them to do so because

it took a lot of time and could not be undone if a mistake was made. With the

development of technology, a wide range of software for digitally designing pictures

was created, reducing the need for human labor and speeding up the process artists

P. Ramya (B) · P. Gan e s h · K. Mouli · V. N. S. Akhil

Department of Computer Science and Engineering, Koneru Lakshmaiah Education Foundation,

Guntur, India

e-mail: mothy274@kluniversity.in

K. A. Ogudo et al. (eds.), Smart Technologies in Data Science and Communication,

Lecture Notes in Networks and Systems 558,

https://doi.org/10.1007/978-981-19-6880-8_4

41

42 P. Ra m y a e t a l .

images compared with, it is more efﬁcient. Which got better over time by getting

more features added. Toy Story, the ﬁrst fully computer-animated feature ﬁlm, was a

huge success in 1995. It featured interactively stunning characters, and the incredible

animation brought them to life.

2 Objectives

2.1 Filter by Median

It is a nonlinear ﬁltering method for reducing noise and increasing edge identiﬁcation

accuracy in a picture. Additionally, consider the image’s edges when removing noise

from it. Exceptional outliers that would distort the average.

2.2 Filter Laplacian

A Laplacian ﬁlter is an edge detector that determines an image’s second derivatives by

detecting the rate at which the ﬁrst derivative is changing. This establishes whether

a change in the values of nearby pixels is the result of an edge or is a result of a

continuous progression. A linear differential problem is treated using the Laplacian

approach using a second-order derivative.

2.3 Filter by Median

It is a non-linear ﬁltering method for reducing noise and increasing edge identiﬁcation

accuracy in a picture. Additionally, consider the image’s edges when removing noise

from it. The best candidates for median ﬁlters are extreme outliers that might distort

the average. The median ﬁlter is frequently used to eliminate noise, salt, and pepper

(Fig. 1).

3 Implementation

3.1 Step by Step Implementation

Main system Purpose is to turn on. In this function, a sidebar is present. You may

simply build one using the sidebar choose box function in Streamlit. (1) The sidebar

has some values, each of which is associated with a function. (2) When a user clicks

Cartoonify an Image with OpenCV Using Python 43

Fig. 1 Original, pencil sketch, pencil edge images

on one of them, the associated procedure is launched. By default, the Pencil Sketch

string is chosen, and the “if” condition calls the PencilSketch() function with st.title

to create a bold title. Additionally, we can utilise St.Image to display any image on

our Streamlit app.

After that, the section with the image-editing tools appears. These include the bilat-

eral ﬁlter, detail enhancement, and pencil sketch. The st.slider method in Streamlit is

used to generate an interactive slider. Here, a widget that lets users choose their own

photos from their local system is added using the streamlit.ﬁle uploader() function. By

browsing or dragging and dropping the image into the box-covered area surrounding

the button, users of this widget can choose their image. Text can be added to the app,

such as messages or any important information, using the streamlit.write() func-

tion. The image is displayed after the user has completed selecting it by calling the

streamlit.image() function with the necessary inputs (Figs. 2, 3 and 4).

Fig. 2 Bilateral

44 P. Ra m y a e t a l .

Fig. 3 Detail enhancement

Fig. 4 Original image

3.2 Statement of the Problem

How a photograph is cartoonized depends on the strategy used by the algorithm. There

are numerous ways to carry out the same task. The most widely utilized techniques

are generative Adversarial networks (GAN), a machine learning framework that

creates new data based on training data, and the OpenCV library, which we use on

our system. (1) The existing system is based on the ﬁlters that were applied to the

input image and the OpenCV library. To construct a custom ﬁlter, a variety of ﬁlters

can be combined or used separately. Among the most famous and well-liked ﬁlters

are Medi-anBlur(), GaussianBlur(), Laplacian(), BilateralFilter(), and many others.

The best results are obtained when these ﬁlters are combined. utilizing a single,

minimally capable ﬁlter.

Cartoonify an Image with OpenCV Using Python 45

3.3 Advanced Technical Approach

Image processing includes the critical step of ﬁltering an image. Among other things,

it can be used to eliminate blur, noise, and detect edges. Algorithms both linear and

non-linear are used for ﬁltering. The appropriate ﬁlter should be used for every

particular objective. If the input image has a large magnitude and there is little to

no noise, the ﬁlter is non-linear. If the input image is low magnitude and noisy, it is

sometimes referred to as a linear ﬁlter. Due of their simplicity and speed, linear ﬁlters

are the most popular ﬁlters. The Gaussian and Laplacian algorithms are used by linear

ﬁlters, and the median and bilateral methods are used by nonlinear ﬁlters. Algorithms

used include pencil sketch, detail enhancement, pencil edge, and bilateral.