ArticlePDF Available

Information Leakage Detection and Risk Assessment of Intelligent Mobile Devices

June 2022
Mathematics 10(12):2011

June 2022
10(12):2011

DOI:10.3390/math10122011

License
CC BY 4.0

Authors:

(1) Background: Smart mobile devices provide conveniences to people’s life, work, and entertainment all the time. The basis of these conveniences is the data exchange across the entire cyberspace, and privacy data leakage has become the focus of attention. (2) Methods: First, we used the method of directed information flow to conduct an API test for all applications in the application market, then obtained the application data transmission. Second, by using tablet computers, smart phones, and bracelets as the research objects, and taking the scores of senior users on the selected indicators as the original data, we used the fusion information entropy and Markov chain algorithm skillfully to build a data leakage risk assessment mode to obtain the steady-state probability values of different risk categories of each device, and then obtained the entropy values of three devices. (3) Results: Tablet computers have the largest entropy in the risk of data leakage, followed by bracelets and mobile phones. (4) Conclusions: This paper compares the risk situation of each risk category of each device, and puts forward simple avoidance opinions, which might lay a theoretical foundation for subsequent research on privacy protection strategies, image steganography, and device security improvements.

The data leakage detection process of intelligent mobile devices.

…

The evaluation index relation diagram.

…

The risky application characterization table.

…

A typical high-risk API source code.

…

The proportion of privacy rights.

…

Figures - available via license: Creative Commons Attribution 4.0 International

Content may be subject to copyright.

Available via license: CC BY 4.0

Content may be subject to copyright.

Citation: Yang, X.; Liu, Y.; Xie, J.

Information Leakage Detection and

Risk Assessment of Intelligent Mobile

Devices. Mathematics 2022,10, 2011.

https://doi.org/10.3390/

math10122011

Academic Editor: Daniel-Ioan Curiac

Received: 6 May 2022

Accepted: 9 June 2022

Published: 10 June 2022

Publisher’s Note: MDPI stays neutral

with regard to jurisdictional claims in

published maps and institutional afﬁl-

iations.

Licensee MDPI, Basel, Switzerland.

This article is an open access article

distributed under the terms and

conditions of the Creative Commons

Attribution (CC BY) license (https://

creativecommons.org/licenses/by/

4.0/).

mathematics

Article

Information Leakage Detection and Risk Assessment of

Intelligent Mobile Devices

Xiaolei Yang , Yongshan Liu * and Jiabin Xie

School of Information Science and Engineering, Yanshan University, Qinhuangdao 066000, China;

yangxl@stumail.ysu.edu.cn (X.Y.); ean@stumail.ysu.edu.cn (J.X.)

*Correspondence: jsjbs0019@163.com

Abstract:

(1) Background: Smart mobile devices provide conveniences to people’s life, work, and

entertainment all the time. The basis of these conveniences is the data exchange across the entire

cyberspace, and privacy data leakage has become the focus of attention. (2) Methods: First, we

used the method of directed information ﬂow to conduct an API test for all applications in the

application market, then obtained the application data transmission. Second, by using tablet com-

puters, smart phones, and bracelets as the research objects, and taking the scores of senior users on

the selected indicators as the original data, we used the fusion information entropy and Markov

chain algorithm skillfully to build a data leakage risk assessment mode to obtain the steady-state

probability values of different risk categories of each device, and then obtained the entropy values

of three devices.

(3) Results:

Tablet computers have the largest entropy in the risk of data leakage,

followed by bracelets and mobile phones. (4) Conclusions: This paper compares the risk situation of

each risk category of each device, and puts forward simple avoidance opinions, which might lay a

theoretical foundation for subsequent research on privacy protection strategies, image steganography,

and device security improvements.

Keywords:

directed information ﬂow; information disclosure; information entropy; Markov;

risk assessment

MSC: 60J20; 94A17

1. Introduction

With the rapid development of science and technology, the electronic platform is

becoming more and more intelligent and mobile, which has brought great convenience to

people’s life. Today, with the prevalence of big data, the data itself are also spreading along

the trend of large depth, high production speed, wide dimensions, and low density. At

the same time, the means for hackers to steal information is also powerful, resulting in the

outﬂow of a large number of personal privacy data [

]. Information leakage has become

a hot topic in today’s cyberspace. How to detect, describe, and even protect privacy has

become the focus of the netizens’ close attention.

In 2018, the personal information of 87 million Facebook users was leaked. In Septem-

ber of the same year, the information of another 30 million users was leaked due to hacker

attacks, and the data of 68 million users were leaked due to software vulnerabilities on

14 December

. On 10 January 2019, Bob Diachenko, a hackenproof security researcher, found

that the detailed resume information of more than 202 million Chinese job seekers in the

mongodb database was published online, which was suspected to be leaked by third-party

applications. It is reported that the 202 million resumes stored in this database contain

202,730,434 records with very detailed information including the applicant’s name, height,

weight, address, date of birth, telephone number, email address, political orientation, skills,

work experience, salary expectation, marital status, driver’s license number, professional

Mathematics 2022,10, 2011. https://doi.org/10.3390/math10122011 https://www.mdpi.com/journal/mathematics

Mathematics 2022,10, 2011 2 of 13

experience, and career expectation, totaling 854 gb. In August 2020, a logistics company in

Hebei Province, China reported that its employee account was monitored by the company’s

logistics risk control system for the illegal inquiry of the waybill number information of

non-local outlets, resulting in the possible disclosure of a large number of the customers’

privacy information. On the evening of 15 March this year, the annual “15 March” party

was broadcast on the central ﬁnance and economics channel. The link of “improving digital

rules and building Internet economic conﬁdence” exposed the problem of personal privacy

leakage in enterprises: Zhilian recruitment failed to pass the examination of enterprises,

resulting in a large number of downloads of the resumes of job seekers. As a result, there

are many risks of private information leakage around us.

“Privacy computing theory” ﬁrst appeared in 1999. It pointed out that information

will be leaked only when device users think that the beneﬁts are equal to the risks [

]. Guo

Yu’s research showed that data information disclosure positively affected the privacy infor-

mation disclosure behavior, perceived mobile learning proﬁtability, and privacy control

while self-efﬁcacy positively affected the privacy information disclosure intention, and the

perceived mobile learning risk negatively affected the users’ privacy information disclosure

intention [

]. By studying the privacy information disclosure behavior and protection

of mobile device users, Xiong Jian showed that the factors of the perceived beneﬁts and

perceived risks had a strong impact on the users’ self-perceived willingness [

]. Wang Kan

used comprehensive fuzzy evaluation to evaluate the risk of data leakage in a transaction,

in which the risk factors included network access control, network application protocols,

ﬁrewalls, and identity authentication [

]. Zhao Zhuohe found that the wireless network

used by mobile devices was easy to intercept, resulting in important information and data

being stolen [

]. Li Yanhui believed that the wireless network is open and easy to obtain its

internal structure, so as to obtain important data nodes for targeted interception [

]. Xu

Jiale suggested that the social network or platform failed to strictly control the enterprise

qualiﬁcation, resulting in the platform’s inability to trace the source of information leak-

age [

]. Makhdoom believed that anonymous encryption could make greater efforts to

ensure that receipts were not disclosed [

]. To sum up, for smart mobile devices, the risk

of user information disclosure is distributed in all corners of cyberspace. Although there

are many studies on the risk of privacy disclosure, only a few can comprehensively and in

detail describe the risk factors of privacy disclosure and evaluate the risk of the information

disclosure of tablets, smartphones, and bracelets. Therefore, this paper subdivides and

expands the risk factor indicators considered in the above articles, and ﬁnally combined

them into ﬁve categories and 24 risk indicators to comprehensively evaluate the risk of the

privacy disclosure of tablets, smartphones, and bracelets.

First, based on the directed information ﬂow detection risk application, this paper

constructed an information ﬂow model to track and analyze the privacy points in real time.

Then, it summarizes the various risk factors of intelligent mobile devices in wireless net-

works, selects the risk indicators, and constructs an evaluation model based on information

entropy and Markov chain. Finally, according to the evaluation results, targeted preventive

measures will be issued and implemented.

2. Malicious Application Detection Based on Directed Information Flow

2.1. Basic Theory

Information ﬂow is a classic method to detect the information leakage of risky applica-

tions. This method was born in 1976 and is based on Denning’s grammatical information

ﬂow analysis:

FM =hN,P,SC,⊕,→i(1)

where Nis the set of some logical elements (code segments, variables, etc.) in the system;

Pis the collection of processes and the response subject of information ﬂow; SC is the

collection of safety levels, which is used to judge whether the operation behavior is legal;

⊕

is the operational supremum of the security level, and the result is the minimum common

Mathematics 2022,10, 2011 3 of 13

upper bound of security levels A and B. This indicates the ﬂow direction of the information

ﬂow, which means that the information in A is allowed to ﬂow to B [10,11].

The syntax information ﬂow detection steps are shown in Figure 1.

Mathematics2022,10,xFORPEERREVIEW3of14





istheoperationalsupremumofthesecuritylevel,andtheresultistheminimumcom‐

monupperboundofsecuritylevelsAandB.Thisindicatestheflowdirectionofthein‐

formationflow,whichmeansthattheinformationinAisallowedtoflowtoB[10,11].

ThesyntaxinformationflowdetectionstepsareshowninFigure1.



Figure1.Theflowchartoftheinformationflowdetection.

Inadditiontomaliciousapplications,privacyinformationleakagemayalsooccurin

variousstagesofbigdatacomputing.AsshowninFigure2,underthecloudplatform‐

basedbigdatacomputing,privacyleakagemayoccurduringthedatatransmissionfrom

theapplicationtothecloudserviceprovider,thecloudplatformcomputingprocess,and

thecloudplatformdataoutputphase.Therefore,wefocusedondetectingprivatedata,

andwhetherthisisdirectlytransmittedtotheexternalcyberspace,andifso,iftheappli‐

cationsoftwareisregardedassoftwarewiththeriskofprivacyleakage.



Figure2.Thecloudplatform‐basedbigdatacomputingenvironment.

Themethodcanroughlybedividedintothreesteps:first,abstracttheinformation

flow,analyzetheobjectsourcecode,andextracttheidiommeaningoftheinformation

Start

Abstract

information

flow

Generate

information

flow formula

Compliance with

safety agreement

End

potential safety

hazards

Handling

Figure 1. The ﬂow chart of the information ﬂow detection.

In addition to malicious applications, privacy information leakage may also occur in

various stages of big data computing. As shown in Figure 2, under the cloud platform-

based big data computing, privacy leakage may occur during the data transmission from

the application to the cloud service provider, the cloud platform computing process, and

the cloud platform data output phase. Therefore, we focused on detecting private data, and

whether this is directly transmitted to the external cyberspace, and if so, if the application

software is regarded as software with the risk of privacy leakage.