ArticlePDF Available

The Enhanced Forensic Examination and Analysis for Mobile Cloud Platform by Applying Data Mining Methods

October 2020
Webology Volume 18(Special Issue on Artificial Intelligence in Cloud Computing, 2021):47-74

October 2020
Volume 18(Special Issue on Artificial Intelligence in Cloud Computing, 2021):47-74

DOI:10.14704/WEB/V18SI01/WEB18006

Authors:

Ibrahim Alnajjar

Universiti Utara Malaysia

Massudi Mahmuddin

Universiti Utara Malaysia

Investigating the mobile cloud environment is a challenging task due to the characteristics of voluminous data, dispersion of data, virtualization, and diverse data. Recent research works focus on applying the latest forensic methodologies to the mobile cloud investigation. This paper proposes an enhanced forensic examination and analysis model for the mobile cloud environment that incorporates timeline analysis, hash filtering, data carving, and data transformation sub-phases to improve the performance of the cloud evidence identification and overall forensic decision-making. It analyzes the timeline of events and filters the case-specific files based on the hash values and metadata using the data mining methods. The proposed forensic model performs the in-place carving on the filtered data to guide the investigation and integrates the heterogeneous file types and distributed pieces of evidence with the assistance of the data mining. Finally, the proposed approach employs LSTM based model that significantly improves the forensic decision making.

LSTM-based Evidence Integration in the Cloud

…

nter-Linked Evidence Ratio Vs. Precision

…

Figures - uploaded by Ibrahim Alnajjar

Content may be subject to copyright.

Content uploaded by Ibrahim Alnajjar

Content may be subject to copyright.

Webology, Volume 18, Special Issue on Artificial Intelligence in Cloud Computing

January, 2021

47 http://www.webology.org

The Enhanced Forensic Examination and Analysis for Mobile Cloud

Platform by Applying Data Mining Methods

Ibrahim Ali Alnajjar

School of Computing, Universiti Utara Malaysia, Kedah, Malaysia.

Massudi Mahmuddin

School of Computing, Universiti Utara Malaysia, Kedah, Malaysia.

E-mail: ady@uum.edu.my

Received October 09, 2020; Accepted November 18, 2020

ISSN: 1735-188X

DOI: 10.14704/WEB/V18SI01/WEB18006

Abstract

Investigating the mobile cloud environment is a challenging task due to the characteristics of

voluminous data, dispersion of data, virtualization, and diverse data. Recent research works

focus on applying the latest forensic methodologies to the mobile cloud investigation. This

paper proposes an enhanced forensic examination and analysis model for the mobile cloud

environment that incorporates timeline analysis, hash filtering, data carving, and data

transformation sub-phases to improve the performance of the cloud evidence identification

and overall forensic decision-making. It analyzes the timeline of events and filters the

case-specific files based on the hash values and metadata using the data mining methods. The

proposed forensic model performs the in-place carving on the filtered data to guide the

investigation and integrates the heterogeneous file types and distributed pieces of evidence

with the assistance of the data mining. Finally, the proposed approach employs LSTM based

model that significantly improves the forensic decision making.

Keywords

Mobile Cloud Forensics, Examination and Analysis, Hash Filtering, Data Carving, Data

Transformation, and Data Mining.

Introduction

The dramatic development in Information Technology (IT) has introduced the Mobile

Cloud Computing (MCC) technology [1] that vividly changes the computing environment

from physical to virtual world. Cloud computing has become the dominant technology for

operating mobile applications. MCC enables mobile users to store the data and process the

Webology, Volume 18, Special Issue on Artificial Intelligence in Cloud Computing

January, 2021

48 http://www.webology.org

applications in the cloud, which mitigates constraints of the smartphones such as battery

life, memory capacity, processing delay, and computational power [2].

Smartphones offer greater capabilities to their users in the form of applications with the

support of internet access. In smartphones, most of the mobile applications such as

Google Mail and Facebook utilize cloud computing technology due to the advantage of

the distributed and scalable cloud environment. The rapid increase of cyber-attacks

significantly impacts the MCC that includes Denial-of-Service (DoS), Distributed

Denial-of-Service (DDoS), botnets, web-based attacks, physical-based attacks,

network-based attacks, or application-based attacks [3]. Nowadays, smartphones are

infected by viruses in different ways, such as through Short Messaging Service (SMS)

messages, instant messengers, internet Wireless Application Protocol (WAP) application

downloads, storage cards, and Bluetooth [4]. Hence, cyber criminals take advantage of the

advanced technology such as MCC to commit the crimes in the form of propagating a

terrorist ideology, facilitating terrorist communication, or disseminating attacks against

the software applications, programs, or digital information of the mobile user. Therefore,

conducting a forensic investigation on the mobile cloud environment becomes an

emergent need.

Cloud forensics is also in its infancy stage due to the lack of knowing the full impact of

the digital forensics community on the cloud model [5]. In addition, applying traditional

digital forensic tools and procedures for the investigation of a mobile cloud environment

is inappropriate for the volatile mobile environment and virtualization technologies and

remote storage-enabled cloud environment [6].

Hence, it is essential to understand the impact of cloud computing on smartphone

forensics and analyze whether the existing mobile forensics methodologies, tools, and

techniques support mobile cloud scenarios or not [7]. The phases of digital forensics

include identification, preservation, collection, examination, analysis, and presentation.

Due to the inter-relationship between the mobile and cloud during the execution of the

cloud-based mobile applications, the mobile cloud forensics involves the evidence

correlation in addition to the traditional forensic procedures in both the mobile and cloud

environments. In the context of mobile cloud forensics, the examination and analysis

phase plays a significant role in improving the performance of the investigation compared

to other investigation phases [8, 9]. It is because the major part of the evidence

identification in the cloud scenario heavily relies on the examination and analysis phase in

the smartphone, and crime event decision-making relies on the examination and analysis

phase in the cloud. Examining evidential artifacts from the mobile cloud services such as

Webology, Volume 18, Special Issue on Artificial Intelligence in Cloud Computing

January, 2021

49 http://www.webology.org

emails and communication applications provide the potential information to the forensic

investigator to reconstruct the crime event, which is beneficial for law enforcement to

prove the investigation. Thus, the proposed approach targets to enhance the examination

and analysis phase and its sub-phases in the mobile cloud environment with the assistance

of the data mining methods.

This paper model the mobile cloud forensic methodology, especially enhancing the

examination and analysis phase of the mobile and cloud environment. The primary

contributions are as follows.

• The proposed forensic examination and analysis model incorporates the hash filtering,

data carving, and data transformation sub-phases in both the mobile and cloud

environment to improve the investigation performance along with the help of data mining

methods.

• Instead of massively analyzing the information acquired from the collection phase, the

proposed forensic model performs the timeline analysis and hash filtering to filter the

acquired files in the large-scale collaborative environment effectively.

• By performing the in-place carving in the filtered artifacts, the proposed approach

extracts the potential features alone and validates the files based on the characteristics of

the file types.

• With the target of facilitating the forensic investigation among the heterogeneous type

of evidence in the distributed cloud storage, the proposed approach precisely integrates

the files and thus, builds the forensic evidence taxonomy for the corresponding crime

event.

• The proposed approach improves the investigation performance in the mobile cloud by

executing the proposed sub-phases and applying the data mining methods to deliver

accurate results.

Forensic Methodologies

National Institute of Standards and Technology (NIST) forensic procedure is widely

applied to mobile and cloud investigation. This section reviews the conventional research

works on mobile forensics, cloud forensics, and cloud-based mobile application forensics.

Mobile and Cloud Forensic Approaches

The forensics research [10] automatically monitors the malicious activities performed on

Android devices using several forensic components include a tractor beam for an Android

device, server, analysis framework, and central database. Even though it leverages the

Webology, Volume 18, Special Issue on Artificial Intelligence in Cloud Computing

January, 2021

50 http://www.webology.org

identification of the malicious android applications through ascertaining monitoring of the

android device, it fails to find more than one deleted applications due to the lack of

accessing the persistent storage location in the android. Droid Watch [11] is an

open-source and automated enterprise monitoring system for an Android device, which

continuously gathers, stores, and transfers the forensically-rich information of the Android

smartphone to a web server after obtaining the user consent without root privileges.

However, there is the possibility of a DoS attack before transferring the data to a Web

server. Mobile Forensic Investigation (MFI) life cycle model [12] resolves the

shortcomings in the traditional digital forensic investigation methods using new methods

and techniques in the lifecycle process. It involves data gathering, preservation, and report

generation with its sub-phases. Harmonized Digital Forensic Investigation Process

(HDFIP) [13] extracts the potential evidence of the mobile device in a forensically sound

manner, which ensures the flexibility, availability, integrity, adaptiveness, confidentiality,

accountability, and comprehensiveness during the evidence acquisition. Smartphone

forensic investigation process model (SPFIPM) [14] performs the smartphone

investigation using a fourteen-stage model for finding potential evidence. The research

work [15] presents the forensic data collection methodology for Android devices. Android

Forensic Data Analyzer (AFDA) [16] targets to reduce the workload for the investigator

by correlating the events of the same or different android applications to improve the

exposure of the hidden forensic data. It assists the investigator in effectively and quickly

analyze the forensic image. Android malware forensic analysis model [17] examines the

malware behaviors on the Android device and reconstructs the malicious events to detect

the suspicious programs quickly. The research work [18] conducts the forensic analysis

on two instant messenger applications that is cloud-based mobile applications such as

WhatsApp and Viber. It extracts the data pertaining to those two instant messengers from

its chat logs, images, chat history, and video files. WhatsApp forensics model [19]

extracts the forensic-rich evidence from the volatile and non-volatile memory of the

android device regarding the WhatsApp application activities and then analyzes the

extracted data in a forensically sound manner. The research work [20] assesses the data

related to the cloud storage applications stored on the client-side android and iOS devices.

In this case, such smartphones act as a proxy for the cloud storage data, which are utilized

for investigating the cloud services with fruitful benefits.

Examination and Analysis based Approaches

The research work [21] employs the Visualize Association Inside Emails (VAIE) system

to enable the forensic examiner to acquire the information regarding e-mails. With the

assistance of two layout models such as the spring force model and radial tree model, the

Webology, Volume 18, Special Issue on Artificial Intelligence in Cloud Computing

January, 2021

51 http://www.webology.org

forensic investigator visualizes the E-mail relevant data for better understand-ability.

Instead of analyzing the information about Apple iOS mobile devices such as contact

details, text messages, and voice messages, the research work [22] examines the evidential

artifacts in the third party application that comprises the potential information such as user

account, timestamps, geolocation, native files, and additional contact information on the

mobile device. Twitter, Facebook, Skype, iBooks, Four Square, Where.com, and Bright

Kite are several examples of cloud-based third-party applications. The earlier forensic

examination research works have focused on extracting the information regarding

username and filename alone of the cloud-based mobile applications such as Skype, Viber,

Dropbox, and Facebook. To improve the investigation performance, the research work [23]

forensically analyzes the social networking applications on different Smartphones and

recovers the events, which helps to extract the case-specific information. By forensically

analyzing the Skype application according to work [24], the forensic examiner recovers

the potential evidence from the RAM and NAND flash memories of the Android devices.

The forensic analysis model [25] extracts the evidence both the logical storage and

internal flash memory of WeChat application through MOBIL edit based logical

acquisition and Chip-off extraction based physical acquisition in iPhone respectively.

Even though it examines the WeChat application folders on the mobile device, it lacks to

investigate the application-relevant evidence in the cloud environment.

System Model

This section presents a system model for implementing the proposed forensic examination

and analysis methodology in the mobile cloud environment. Let the cloud-based mobile

application running on the Android device. During the mobile forensics, the investigator

accesses the partitions of the android device, including boot system, recovery, data, and

cache along with the Secure Digital (SD)card and SD-ext. If the forensic investigator

examines the crime event that is launched on the cloud-based mobile application, the

investigator acquires the log file entries, file system metadata structures, application logs,

registry information, chat logs, network packet, and so on. Similarly, the required

metadata information is acquired from the cloud server. These metadata parameters are

considered as the input to the forensic tools for the mobile cloud forensic examination and

analysis.

Let XfM denotes the forensic data belongs to the mobile device, and YfRef denotes the

forensic data belongs to the reference dataset in the National Software Reference Library

(NSRL). Wherein, „f‟, „M‟, and „Ref‟ denotes the features, mobile device, and reference

dataset, respectively. Moreover, consider the „P‟ number of forensic evidences acquired

Webology, Volume 18, Special Issue on Artificial Intelligence in Cloud Computing

January, 2021

52 http://www.webology.org

from the cloud environment, FE= {FE (1), FE (2),..., FE (P)} in which FE (i) is a

particular evidence, and FE (i) is its neighbor evidence, which is to be matched during

co-variance calculation. Each forensic evidence has „N‟ number of features, f= {1,2,...,N}.

For instance, each forensic record or data in a set of acquired data from the mobile devices

is denoted as XfM, and each forensic record or data in a set of good forensic data in the

reference dataset is referred to as YfRef. Let the forensic data have two features, such as IP

address and IMEI number. Hence, N = 2. For example, the acquired forensic data (XfM)

from the suspected mobile device have the IP address (Xf1M) of 131.175.17.9 and IMEI

number (Xf2M) of 990000862471238. The reference dataset comprises that sample record

(YfRef) is 131.175.17.9 as a legitimate IP address (Yf1Ref) and 990000862471854 as the

IMEI (Yf2Ref) of the genuine user. In the example above, the IP address and IMEI number

are the features of the forensic data or record.

It is essential to ensure that the acquired information is in the form of a structured dataset

such as Comma Separated Value (CSV) format to apply the data mining techniques on the

acquired forensic artifacts. It is because the proposed forensic model attempts to reduce

the burden of forensic investigation and apply the mining techniques. Hence, it transforms

the input forensic data into the required table format of rows and columns in which rows

represent the unique forensic data or record, and columns denote the fields, features, or

attributes. From the application database, the messages are modeled as AM with different

fields, including receiving message ID, sent message ID, sent and received time, media

caption name, file type, file size, and so on. Metadata assists in finding the associated

artifacts that are useful to verify the fraud, malicious insider, and other types of

cybercrimes in the mobile cloud. It describes the attributes of the applications or files in

the digital source of evidence, providing the logical consistency, accuracy, and coherence

of the applications or files. The evidence parameters are in different data types such as

string (S), integer (I), boolean (B), data and time (DT). If the source of evidence is log file

entries, the parameters involve message or event ID, timestamp, Internet Protocol (IP)

address, International Mobile Equipment Identity (IMEI), and user ID. Application logs

include the application name, version, and a timestamp. The parameters of the File system

metadata structures are file name, file size, file type, creation time, modification time,

access time, and file status, such as active or hidden. In the network packet, the

parameters of the source of evidence include packet length, source IP, and destination IP.

Figure 1 illustrates the components involved in the proposed mobile cloud forensic

examination and analysis.

Webology, Volume 18, Special Issue on Artificial Intelligence in Cloud Computing

January, 2021

53 http://www.webology.org

Figure 1 Forensic Examination and Analysis Model

An Improved Mobile Cloud Forensic Examination and Analysis Model

In forensic computing, examination and analysis are the essential phases in improving

investigation accuracy and efficiency. According to the NIST definition, „Examination‟ is

the process of identifying and extracting the relevant artifacts from the acquired data

while protecting the data integrity and „Analysis‟ is the process of analyzing the

examination results to derive the significant artifacts regarding the questions which

stimulate the collection and examination process.

Figure 2 shows the main phases of the mobile cloud forensics and designs the necessary

sub-phases for the examination and analysis phase in the mobile cloud environment. The

sub-phases of the forensic examination and analysis phase include timeline analysis, hash

filtering, data carving, data transformation, cross-referencing, and keyword searching. The

proposed approach focuses on improving the forensic investigation in the mobile cloud

Webology, Volume 18, Special Issue on Artificial Intelligence in Cloud Computing

January, 2021

54 http://www.webology.org

environment by enhancing the examination and analysis phase, which has depicted in

Figure 3.

Figure 2 The Phases of the Mobile Cloud Forensics with the Processes involved in the

Examination and Analysis Phase

Mobile Cloud Forensic Model

Timeline Analysis

Hash Filtering

Data Carving

Data transformation

Identification

Preservation

Collection

Examination and

Analysis

Presentation

Webology, Volume 18, Special Issue on Artificial Intelligence in Cloud Computing

January, 2021

55 http://www.webology.org

Figure 3 The Proposed Forensic Examination and Analysis Phase in Mobile Cloud

Timeline Analysis

In the context of mobile cloud forensics, timeline analysis plays an essential role in

filtering the potential evidence regarding the time of the crime event in both the mobile

device and the cloud without analyzing the evidential artifacts which are irrelevant to the

Mobile Cloud Forensic Examination and Analysis

Validating the

files based on

the file type‟s

characteristics

Extracting the

features using

PCA and

In-Place

Carving

Constructing

the

case-specific

evidence

taxonomy

Data Transformation

Data Carving

Analyzing

the timeline

of the events

Filtering files

based on

metadata and

hash values

Hash Filtering

Acquired Data

or files from

suspected

Smartphone or

Cloud

Classifying

the files

using deep

neural

network

Integrating the

files through

string matching

and LSTM

model

Cyber criminal detection for the

respective crime event with the

evidences

Webology, Volume 18, Special Issue on Artificial Intelligence in Cloud Computing

January, 2021

56 http://www.webology.org

crime event. In order to identify the creation, modification, and deletion time of the file

before and after the infection time, the forensic examination and analysis methodology

performs the timeline analysis to estimate the activity performed by the attacker after the

infection. During the timeline analysis, the four different types of times are considered as

the key factors, including the last modification time, the last access time, the last change

time of the Master File Table (MFT) entry, and the creation time. To obtain higher

investigation accuracy, the forensic examiners need to perform end-to-end temporal

analysis in terms of the timeline of events. Temporal characteristics have crucial

importance over different aspects such as the interaction of events with people, processes,

or objects.

In the cloud environment, forensically validating the logging framework tends to create

challenges in the timeline of events. The existing forensic investigation methods examine

only for the accessed timestamps, data remnants, and file contents, which consider the

logs are not necessary for the forensic investigation. However, the examination of log

files provides fruitful benefits to the investigator in connecting the dots during the

investigation. To effectively perform the timeline analysis in the mobile cloud

environment, the proposed examination and analysis methodology has focused on

modeling the knowledge representation. The knowledge representation model comprises a

large set of entities and relations between the events, which necessitates the subsequent

forensic analysis of other entities. It facilitates the reconstruction of the investigation

process to provide credibility to the investigated results. An automated timeline tool

forensically analyzes the web servers to determine the previously occurred website links

by applying the preprocessing on the raw web log files and path analysis of the URL

information. During the timeline analysis, the forensic investigator initially examines the

date and day concerning the occurred event.

Hash Filtering

The data reduction methods and hash sets address the data size constraint to handle an

ever-increasing amount of data during a forensic investigation. Hash filtering is one of the

time-saving techniques for the mobile cloud forensics examiner while dealing with

large-scale data. Consequently, it reduces the time on investigating the known suitable

files and facilitates the examiner in the mobile cloud environment within the scope of the

investigation. The proposed model generates the hash files for every file and validates the

generated hash files with a set of hashes of the previously calculated known useful files to

filter out the matched known and suitable hash files from the forensic database. The files

belong to the residual hash files in the forensic database are to be examined further by the

Webology, Volume 18, Special Issue on Artificial Intelligence in Cloud Computing

January, 2021

57 http://www.webology.org

forensic investigator. By applying the deep neural network classifier, the proposed model

recognizes the patterns in the data and classifies the relevant files that are to be used for

further investigation. In order to enhance the effectiveness of the forensic database that is

to be investigated, the proposed model focuses on filtering the files stored in the forensic

database based on the relevance to the crime event. Initially, it attempts to determine the

ignorable files from the acquired data based on the hash data that reflects the

corresponding software. Moreover, it examines the most recent hash values from the hash

database to retain only the potential files for further investigation.

Figure 4 Steps involved in the Hash Filtering Process

In the field of mobile forensics, the current methods have filtered the irrelevant files

through reporting the consistent hash files and validating the reported hash data of

individual files from its back-to-back acquisitions and subsequent acquisitions,

respectively. In the evidence database, the images contain metadata or Exif information

Learning the features of the

known good files

Correlating the features of

the files

Hash Filtering in Examination and Analysis

Smartphone

Cloud

Extracted Metadata

Generated Hash Values

Decision-making

Classifying the files using

Neural network

Matching the Metadata and Hash values

of the files

Webology, Volume 18, Special Issue on Artificial Intelligence in Cloud Computing

January, 2021

58 http://www.webology.org

involving the timestamp and several additional attributes that assist in determining the

source of the image during the investigation. Metadata is the information about the files

that are acquired from the mobile device and the cloud, includes file size, file name, last

accessed time, and so on.

(1)

Equation (1) computes the correlation score between the data residing in the mobile

device (XfM) and the reference dataset (YfRef). In Equation (1), „f‟ refers to the features

that vary from „1‟ to „N‟, which are based on three main categories such as file metadata,

case metadata, and file content-based data. The features include file type, IP address, file

name, permissions, file ID, email ID, file content type, URLs, and so on. If the correlation

score is high when correlating with the known useful features, the proposed methodology

ignores that information from the forensic evidence database during the investigation.

In the field of cloud forensics, segregating the relevant evidence from the acquired

evidence is a challenging process due to the cause of the encrypted data. Even though live

forensics enhance the forensic capabilities through frequently capturing the images of the

running environment, it leads to overhead, cost, and performance issues. Hence, securely

maintaining the hash values of the forensic artifacts is essential such as secure log files.

Subsequently, the forensic investigator needs to create standard forensic procedures to

have access to the decryption key without privacy violations.

(2)

Equation (2) predicts the class of the input data while employing the neural network as a

classifier for the forensic good and bad files classification. In Equation (2), „wi‟ represents

the weight that shows the relative influence of the input „xi‟, and „b‟ denotes the bias in

the classification results based on the neurons. By training the neural network classifier

with the known good files characteristics or patterns, the proposed methodology filters out

Webology, Volume 18, Special Issue on Artificial Intelligence in Cloud Computing

January, 2021

59 http://www.webology.org

the good information from the acquired evidential artifacts. Initially, the forensic

investigator identifies several parameters such as timestamp, IP address, and Media

Access Control (MAC) address for forensic analysis to retain the evidence of malicious

activity alone. Moreover, search activities of frequent users in terms of verifying their IP

addresses, byte transferred, and file access. By analyzing the behavior of the user, the

proposed model confirms the IP address of the malicious user.

Data Carving

In the field of cyber forensics, the term „file or data carving‟ has been used, which extracts

the data from the raw data in terms of evidential artifacts through identifying and

recovering the relevant data based on the forensic analysis. Data or file carving plays a

crucial in forensically determining the deleted or hidden files from the digital media

stored in both the mobile and the cloud.

In the smartphone and the cloud, the hidden areas of a file include slack space, lost

clusters, and unallocated clusters of the digital media or disk. In order to apply this type of

extraction method, there is a need for a standard file signature with the file header and

footer, which enforces the search to extract and analyze the file during the file validation.

The existing forensic data carving tools search the header signature of each file in the

digital media even when there is the existence of the file system for the corresponding

media. In this context, to mitigate the search space, it is adequate to search the header

signatures in the slack space, lost clusters, and unallocated clusters on the disk.

Also, to minimize the time of searching the header signature, the data carving methods

have focused on searching the header signatures within the few bytes of the cluster and

sector, and throughout the sectors. In the data carving method, file structure-based carving

identifies the information level in a file format to recognize the file while matching the

extracted information on the file format with the raw dataset. The content-based file

carving method explores the content of the clusters to identify the relationships between

the files regarding a specific file, which assists to reassemble the original file recovery.

Webology, Volume 18, Special Issue on Artificial Intelligence in Cloud Computing

January, 2021

60 http://www.webology.org

Figure 5 Steps involved in the Data Carving Process

Figure 5 shows the proposed data carving process for the mobile cloud forensics. The

proposed examination and analysis model applies the file carving in the hidden areas

when there is an availability of the file system. Moreover, it performs the file carving

from the raw image even when there is no file system. In consequence, the proposed

model employs a disk analysis tool to provide the in-place or zero storage carving in the

slack space, lost clusters, and unallocated clusters. In essence, the proposed forensic

methodology incorporates several key components to perform the file carving process

with the time efficiency in the mobile cloud environment. After filtering the irrelevant

data from the forensic database, the proposed model attempts to provide the dynamically

updated file system to the investigator to mitigate their waiting time when there is

unavailable of the file system. In subsequence, it applies the signature, file structure, and

content-based carving method to extract the critical data from the hidden areas of the

digital media. In the proposed methodology, data carving involves the signature-based

carving, file structure based carving, and content-based carving, which extracts the hidden

information from the reduced set that acquired evidential artifacts. To extract the data

Data Carving in Examination and Analysis

Cloud

Smartphone

Reduced evidential

artifacts

Reduced evidential

artifacts

Signature, File structure and

Content based carving

Dynamic updation of

Android File system

Dynamic updation of

Remote File system

In-Place Carving

Feature extraction and

In-Place Carving

File Validation

Webology, Volume 18, Special Issue on Artificial Intelligence in Cloud Computing

January, 2021

61 http://www.webology.org

hidden in the files, the forensic investigator needs to understand the signature, structure,

and content of the files due to the availability of the different data hiding process. Figure 6

illustrates the detailed procedure of the data carving process in the proposed mobile cloud

forensic examination and analysis methodology.

• File Signature Based Carving: With the help of file signatures, the proposed data

carving method identifies the file types from the unallocated clusters. The file signature is

referred to as a text value, magic number, or numerical value, which assists the

identification of file format. File signature-based data carving plays a significant role in

defending against the data hiding techniques, which facilitates the forensic investigator in

the massive mobile cloud environment. File signature analysis is based on the process of

matching the files, headers, and extensions with the existing databases to determine the

hidden files. In essence, the forensic investigator focuses on the extraction of the

information about the type of the file from the header or footer fields in a file, termed as

file signature. The forensic investigator often matches the file signature with the file

extensions to identify similar files, in some cases, there are few exceptions, such as

mismatches, unknown types, no matches, and anomalous results while matching the files.

File signatures are also known as the magic numbers, which have unique values with the

replacement of named constants. The file signatures are either in the hexadecimal format

or the ISO 8859-1 encoding format for different file formats. For example, the file format

of Adobe PDF and JPEG image has the file signature as the 25 50 44 46 and FF D8 FF

respectively, which are in the hexadecimal format.

• File Structure-Based Carving: In order to determine the fragmented files, extracting

the internal structure of the file is essential to facilitate the forensic investigator in

recovering a file and identifying the starting and ending point of the data. The forensic

technique of file carving focuses on recovering the files based only on the file structure

and content without matching the metadata of the file system. It often recovers the files

from the unallocated space in the mobile and cloud storage in which unallocated space

indicates that the area of holding no files as a long time. Moreover, it recovers the files

from entire storage when there is the existence of the missing or damaged file structures

in both the mobile and cloud environments. By utilizing the file system structure, the

forensic investigator quickly identifies and extracts the undeleted data from the

unallocated space. For example, consider a bitmap file comprises the file size in bytes in

the footer, JPEG file contains the metadata sequence, and word file comprises the byte

strings such as keywords, author, and company. By accessing the file system structure, the

forensic investigator can easily recover the deleted file from the presence of the file entry

and information linking with the clusters.

Webology, Volume 18, Special Issue on Artificial Intelligence in Cloud Computing

January, 2021

62 http://www.webology.org

• File Content-Based Carving: File content based carving relies on the content

structure such as XML and HTML and content characteristics such as statistical attributes,

character count, text, or language recognition. File content carving is also known as the

semantic carving. The file content is mostly encoded in different ways while storing the

data in the file for different applications. Hence, the associated applications only access

the stored file in terms of reading and extracting the content of the file. Instead of deleting

the malicious files or evidence, the malicious individuals or criminals often hide the

information inside another file by changing its extensions to misguide the investigator.

For example, criminals hide their sensitive information by modifying the file extension,

such as changing the.doc file to.jpg file.

During the mobile forensics, the proposed methodology focuses on the dynamic updation

of the file system, in-place carving, signature, file structure and content-based carving,

and file validation. Whereas, in the context of the cloud forensics, to deal with the

massive amount of data storage, the proposed methodology employs the feature extraction

techniques such as Principle Component Analysis (PCA) while analyzing the content of

the files along with the in-place carving. In both the mobile and cloud, in-place carving

enables the investigator to examine the files without copying the content of the files,and it

stores the metadata of the files in the forensic database for further forensic examination.

The in-place carving process outcomes the details include the file name, starting position

of the file in the device or cloud, file length, location of the file, and truncated details. By

applying PCA, the proposed forensic examination and analysis methodology selects a set

of features based on the principle components of the massive cloud Forensic Evidence

(FE). To identify the principle components, the proposed methodology computes the

covariance matrix based on the training samples and calculates the eigenvectors and

eigenvalues. The feature extraction or feature selection method forms a subset of the

features of the raw features, which retains only the potential contents of the overall data.

For instance, to identify the file types such as „doc‟, „gif‟, „pdf‟, „jpg‟, and „html‟, the

proposed model analyzes the features in terms of the frequency of occurring each byte

value as well as the content of the file. It assumes that the same file types have the same

characteristics even for the different files, which assists in detecting the file type related to

the suspected activity. In order to reduce the false positives and speed up the carving time,

the proposed methodology performs the file validation based on the file type validator

during carving operations. The file validation method conducts the matching process for

the retained details of the carving operations in the mobile cloud environment. The file

validation of the proposed methodology matches the carved details of the files on the

mobile device with the carved details of the files in the cloud to determine the inherently

Webology, Volume 18, Special Issue on Artificial Intelligence in Cloud Computing

January, 2021

63 http://www.webology.org

correlated files, and fine-tune the evidential artifacts towards the potential evidence for the

corresponding crime event.

Figure 6 Data Carving Process in Mobile Cloud Forensic Examination and Analysis

Methodology

Data Transformation

In the mobile cloud environment, data transformation or integration plays a crucial role

mainly to facilitate the hidden patterns and potential information from a large amount of

cloud evidential artifacts compared to the mobile evidence. Data integration is the process

of combining massive data from the disparate sources and transforms it into a unified

structure. In the context of mobile cloud forensics, data integration combines the data

Data Carving in Mobile Cloud Forensic Examination and Analysis

Evidential Artifacts in Mobile and Cloud

Android File

System

Cloud Remote

File System

Android and

Remote File

System

Dynamic Updation

Carving Operations

Matching analyzed

details with

suspected artifacts

Hexadecimal

signature of the

file analysis

Header or

Footer of the

file analysis

Content structure

and Characteristics

Analysis

Retaining potential features in the Cloud

In-place carving based file filtering

Validating files filtered in the mobile artifacts

with Cloud artifacts

Webology, Volume 18, Special Issue on Artificial Intelligence in Cloud Computing

January, 2021

64 http://www.webology.org

acquired from the various data centers and transforms the heterogeneous types of forensic

evidence into a single format. Consequently, it assists in guiding the investigator to make

the right decisions at the right time during the investigation. Even though traditional data

mining techniques efficiently perform the integration of large volumes of unstructured,

semi-structured, or streaming data, it lacks to integrate heterogeneous evidence from

different dynamic locations. Most of the existing security systems examine the footprints

of the intruders from the log files of the system to identify the potential evidence from the

massively acquired evidence regarding the suspicious activities. Even though current

forensic systems support the investigation with the trace of network traffic, account

management, file system checkers, system monitoring, and system log files, there is a lack

of providing adequate pieces of evidence for the event reconstruction due to the

independent logging.

Figure 7 Steps involved in the Data Transformation Process

Smartphone

Cloud

Data Transformation in Examination and Analysis

Distributed and

Heterogeneous type of

evidences

Heterogeneous type of

evidences

Integrating the files from

virtual clouds using LSTM

based forensic logging

Integrating the file types

through string matching

Analyzing metadata of the

evidences

Building Evidence Taxonomy

Webology, Volume 18, Special Issue on Artificial Intelligence in Cloud Computing

January, 2021

65 http://www.webology.org

Integrating the Forensic Logs using Long-Short Term Memory (LSTM) Model

Figure 7 illustrates the proposed components incorporated with the data transformation in

the mobile cloud forensic examination and analysis phase. In the field of mobile cloud

forensics, the proposed forensic examination and analysis model handles the

heterogeneous types of evidence acquired from the smartphones and cloud concerning the

cloud-based mobile application execution.

If the forensic collection phase acquires the system call and its accompanied activities of

the users, the investigator easily reproduces the security breaches or crime events that

happened on the smartphone. The acquisition of system calls enables the investigator to

accomplish the time efficiency and completeness of logging data through only recording

the relevant information. Owing to the existence of the system call logging module in the

suspected device, the forensic investigator identifies the modification of logging

information from the recorded activities in the backend storage system.

The logging system collects the information about all the activities of the cloud-based

mobile application performed by the users and securely stores the information in a

forensic server to facilitate the forensic investigation. Thus, in essence, the proposed

examination and analysis phase needs to analyze both the volatile and non-volatile

information in the aspect of the users and processes, chronological order, and activities of

the users. Moreover, with the help of LSTM, the proposed data transformation model

integrates the forensic evidence into a unified format from the multiple modalities.

During mobile forensics, the proposed forensic methodology handles the heterogeneous

types of the evidence acquired from the activities of the cloud-based mobile application of

the suspect‟s device and integrates such files with the help of string matching regarding

the directory, file name, and metadata.

In cloud forensics, the proposed data transformation model employs the LSTM

architecture, which is an extension of the Recurrent Neural Network (RNN). The LSTM

architecture is more suitable for the forensic data integration since it contextually

memorizes and recalls the forensic images even when the forensic database comprises the

snapshots as the raw pixel data of the image in terms of high dimensional inputs. Hence,

the proposed forensic methodology combines the LSTM architecture with the multimodal

autoencoder to unify the information of various modalities, which assists in realizing the

temporal sequence of the multimodal representation for a particular event. In the context

Webology, Volume 18, Special Issue on Artificial Intelligence in Cloud Computing

January, 2021

66 http://www.webology.org

of forensics, the proposed approach considers the heterogeneous type of evidence, such as

text log files and snapshot images, as the learning modalities.

Initially, the proposed forensic model extracts a set of feature vectors for each modality

using the encoder LSTM and then forms the integrated feature vector of all the modalities

with the consideration of temporal information. In subsequence, it employs the

multimodal autoencoder to reconstruct the feature vectors for each modality from the

integrated feature vector and then exploits the decoder LSTM to decode the input in

chronological order from the reconstructed feature vector.

Thus, it facilitates the forensic investigator through heterogeneous data types integration

acquired from the distributed data centers regarding a particular cloud-based mobile

application execution. In the cloud environment, the forensic data transformation is

relatively similar to the forensic logging supporting four services, such as completeness,

authenticity, reproducibility, and efficiency.

An LSTM network [26], input gate, forget gate, and cell state are the essential elements.

In order to support the forensic logs integration with the knowledge of the crime event

and evidence nature, the proposed methodology attempts to model the LSTM with an

additional control cell that manipulates the forensic features during the learning process.

As a result, the LSTM based forensic methodology decides that the information be

retained for further investigation in an integrated form. It categorizes the forensic

evidential artifacts based on the relevancy in terms of activities within a particular

timestamp or region.

(3)

In Equation (3), Wf event and Wsfl are the weight information related to the forensic

event and layer-wise similar features, respectively. Wt is the input at time step „t‟, and ɑl

is the layer-wise constant. Ft is the forensic gate that is used to learn certain patterns

regarding the crime event and forensic characteristics, which is used to update control

vector in the LSTM network, i.e. dt=Ft ⊙ dt-1.

Webology, Volume 18, Special Issue on Artificial Intelligence in Cloud Computing

January, 2021

67 http://www.webology.org

Figure 8 LSTM-based Evidence Integration in the Cloud

Figure 8 depicts the proposed evidence integration process using LSTM in the massive

and distributed evidence storage environment of the cloud. The proposed forensic

integration and linking among the extracted evidence reveal that the relationships between

the same activities and also existing relationships between the targeted victim and the

evidence. According to the committed crime in the mobile cloud environment, the

proposed forensic examination and analysis methodology captures the possible relevant

pieces of evidence that are linked to either attacker or targeted suspect. To reduce the

complexity of linking or grouping the evidence related to the same activity or person from

the distributed storage, the proposed model employs the LSTM for data integration. The

LSTM-based classification model automatically handles this process to reduce manual

errors and mitigate time. It finds the relationship between the previously obtained

evidence. The primary advantage of the long-term memory in the LSTM enforces the

integration of the evidence through its processing of sequence chains. Wherein gates

decide which information is relevant to the sequence of evidence to make decisions on

Distributed and Heterogeneous Types of

Evidences in the Cloud

LSTM based Evidence Integration

Learning a

specific crime

event and

forensic

features

Organizing the

similar or

integrated

evidences in

chronological

order

Time and

features based

Evidence

Classification

Feature Vectors

Extraction for

all modalities of

the evidences

Analyzing unified features of a set of integrated evidences

from the header/footer

Building Evidence Taxonomy based on the unified features

Webology, Volume 18, Special Issue on Artificial Intelligence in Cloud Computing

January, 2021

68 http://www.webology.org

either keeping or forgetting the information during training. In LSTM, sigmoid activation

squishes the values between 0 and 1 to update or forget the incoming data. The forget gate

decision heavily relies on the outcomes of the sigmoid function. By processing the

information from the previous hidden state and current state, the sigmoid function

outcome values between 0 and 1. If the value is closer to 0, the LSTM forgets the

information; otherwise, it keeps that information. The input gate and output gate are

responsible for deciding what information is relevant to add from the current state and

decides what information is to be in the next hidden state respectively. Accordingly, at the

end of the LSTM, the proposed methodology obtains a set of grouped evidence by

applying Equation (3), which facilitates the forensic investigation rather than analyzing

the evidence of a particular crime event in a distributed manner.

In the mobile cloud environment, the forensic investigator combines all the forensic logs

or evidence into a forensic repository to analyze the acquired files for the corresponding

malicious activity performed in the cloud-based mobile application. The log machines or

forensic server stores all the log files acquired from both the mobile and cloud

environments. In essence, the forensic server contains the forensic data repository. The

integration of several mobile cloud forensic logs includes system_log, secure_log,

packet_log, and volatile log. Finally, to ease the forensic investigation process over the

abundant collection of the cloud forensic evidence, the proposed forensic examination and

analysis methodology builds the case-specific forensic evidence taxonomy for the

corresponding evidence. In order to model the massive cloud evidence in the form of

machine-understandable format, annotation of the evidence with metadata has performed

by the proposed forensic methodology. The evidence management process annotates the

evidence with the semantic information in a logical manner. The tagged or annotated

information assists the investigator to analyze and report the evidence in the cloud easily.

In the proposed case-specific evidence taxonomy, the metadata provides the rudimentary

details about the corresponding evidence such as folders and files, which comprises the

concepts and properties to model the taxonomy with richer description. The proposed

forensic methodology improves the forensic decision-making by finding the criminal for

the corresponding crime event performed in the cloud-based mobile applications. It

quickly responds to the investigation query by retaining and analyzing only the

forensically relevant information such as case-specific data in the mobile cloud

environment with the support of the data mining techniques. Finally, the mobile cloud

forensic framework iteratively correlates the evidence that is analyzed from both the

mobile and cloud and provides significant results under the forensically sound conditions.

Webology, Volume 18, Special Issue on Artificial Intelligence in Cloud Computing

January, 2021

69 http://www.webology.org

Experimental Evaluation

This section describes the prototype for evaluating the LSTM-based data transformation

in the proposed forensic examination and analysis methodology. The experimental model

evaluates the part of the enhanced forensic methodology with a set of logs collected from

the Open Nebula cloud environment.

Experimental Setup

The experimental model employs Open Nebula to collect cloud activity logs regarding the

execution of mobile cloud applications. Open Nebula is a widely used open-source toolkit

for Infrastructure as a Service (IaaS) cloud computing, which is a virtualization tool

enabling the computations in a private cloud or public cloud. It comprises three logging

systems, such as syslog logging, logging to standard error system, and file-based logging

systems. The proposed forensic model utilizes the log files extracted from „/var/log/one‟

in Open Nebula, which is a file-based logging system. The investigator acquires the

evidence from the multiple log files such as oned.log, sched.log, sunstone.log, and

VMID.log. In essence, Open Nebula stores the logs for the activities performed in the

Virtual Machine as the files with the VM ID similarly, the multiple log files exist in Open

Nebula cloud. For example, if a suspect launches the malicious activity during application

execution in VM1, the forensic investigator needs to collect the VM1.log file. Moreover,

Open Nebula supports features such as multi-tenant computing, data center federation,

and virtual data centers on top of vCenter. Multiple OpenNebula data centers or zones are

in the form of federation, sharing the information related to the same user accounts,

permissions, and groups across the virtual data centers. As a result, the forensic

investigator acquires the evidence of a particular suspect or event from the different

OpenNebula virtual data centers.

Details for LSTM Implementation

This section demonstrates the final process of the proposed examination and analysis

methodology in terms of implementing the LSTM-based data transformation process for

the cloud evidence using the Java programming language. The experimental model

assumes that the synthetic input dataset of LSTM is the outcome of several proposed

examination and analysis processes such as timeline analysis, hash filtering, and data

carving. After partially fine-tuning the acquired evidence, the experimental model

conducts the experiments on the cloud evidence using LSTM according to the procedures

explained in the data transformation process. The synthetic dataset of the cloud evidence

consists of several fields such as crime event timestamp, crime type such as identity theft,

Webology, Volume 18, Special Issue on Artificial Intelligence in Cloud Computing

January, 2021

70 http://www.webology.org

stalking, pornography, hacking, software piracy, and illegal electronic surveillance,

user_ID, application_ID, event_ID, datacenter or zone_ID, server_ID, log_ID, and

timestamp. Also, it comprises several footprints such as file creation time (C Time), file

altered time (A Time), file read time (R Time), Master File Table (MFT) changed time (M

Time), file permissions, including read-only, archive, offline, hidden, temporary,

encrypted, and compressed, file size, file type, access control type such as access allowed,

denied, and audit, file name namespace, and file name length. The experimental model

applies the LSTM algorithm on such a generated synthetic dataset for the distributed

evidence linking based on the sequence of inputs over time. The experimental model

splits the generated dataset into a training set and testing set in the ratio of 70:30. From

the entire forensic evidence in the generated dataset, 70 percent of the dataset is applied

for training the LSTM and the remaining 30 percent for testing. The experimental model

employs the crime type field as the target variable which is selected from the training set

to integrate or inter-link the forensic evidence from the perspective of distributed evidence

and heterogeneous file types of a similar crime event. As a result, the proposed model

delivers relevant evidence for the corresponding crime event with the assistance of the

LSTM and forensic investigator. According to the federal rules of evidence [27], relevant

evidence is defined as “any tendency to make the existence of any fact that is of

consequence to the determination of the action more probable or less probable than it

would be without the evidence”.

Performance Metrics

Precision: It is the ratio between the number of accurately classified or identified files as

the relevant evidence and the total number of files classified as the evidence by the

system.

Recall: It is the ratio between the number of accurately classified or identified files or

evidence as the potential evidence and the total number of files or evidence that are

relevant to the incident.

Experimental Results

The experimental results illustrate the performance of the Forensic Evidence Integration

using LSTM (FEI-LSTM) for the cloud evidential artifacts with the help of the precision

and recall performance metrics.

Inter-Linked Evidence Ratio Vs. Precision

Figure 9 illustrates the precision of the FEI-LSTM for the increased amount of

inter-linked evidence ratio. Inter-Linked Evidence Ratio (ILER) is the ratio between the

Webology, Volume 18, Special Issue on Artificial Intelligence in Cloud Computing

January, 2021

71 http://www.webology.org

number of evidence that is related to the evidence in different datacenters and the total

number of evidence acquired for a particular crime event. Initially, the precision value

increases from 86% to 87.5% for the variation of Inter-Linked Evidence Ratio from 0.2 to

0.6.

After reaching the ILER from 0.6, the proposed FEI-LSTM model maintains the precision

value in the average of 87.6% by considering the inherent forensic evidence footprints

during the integration and classification of the evidence using the LSTM model. By

performing the integration of the distributed evidence, the proposed forensic examination

and analysis methodology assists the forensic investigator accurately determine a set of

relevant evidence regarding a crime event.

Figure 9 Inter-Linked Evidence Ratio Vs. Precision

Inter-Linked Evidence Ratio Vs. Recall

Figure 10 Inter-Linked Evidence Ratio Vs. Recall

Webology, Volume 18, Special Issue on Artificial Intelligence in Cloud Computing

January, 2021

72 http://www.webology.org

The performance of the recall or sensitivity or true positive rate is depicted in Figure 10

while varying the Inter-Linked Evidence Ratio for the LSTM based integration and

classification model. The FEI-LSTM model assists the forensic investigator to precisely

filter the evidence that is relevant to the crime event based on the potential footprints of

the criminal activity, primarily, crime type and the crime occurred timestamp. As a result,

the recall value gradually increases with the increase of the Inter-Linked Evidence Ratio

from 0.2 to 1.0. The FEI-LSTM model yields the average recall to 86%, even when

increasing the number of distributed evidence and multi-modal similar evidence.

Moreover, the temporal sequence-based deep learning in the proposed forensic

examination and analysis methodology ensures that the balanced recall value of the

massive collection of heterogeneous file types inthe same crime event and distributed

evidence in the cloud.

Conclusion

This paper presented an enhanced mobile cloud forensic examination and analysis model

with the incorporation of the essential sub-phases along with the adoption of the data

mining techniques. The proposed forensic examination and analysis model has improved

the sequential process of the forensic investigation on both the mobile and cloud

environments. It has ensured that the improved performance of the forensic

decision-making by introducing and enhancing the sub-phases of the forensic examination

and analysis phase in the mobile cloud environment. By modeling the data

mining-assisted enhanced examination and analysis phase, the proposed methodology

ensures the improved performance of the investigation in the mobile cloud. The

experimental results show that the performance of the LSTM based evidence integration

and relevant evidence identification process through precision and recall.

References

Fernando, N., Loke, S.W., & Rahayu, W. (2013). Mobile cloud computing: A

survey. Future generation computer systems, 29(1), 84-106.

Qureshi, S.S., Ahmad, T., & Rafique, K. (2011). Mobile cloud computing as future for

mobile applications-Implementation methods and challenging issues. In IEEE

International Conference on Cloud Computing and Intelligence Systems,

467-471.

Shahzad, A., & Hussain, M. (2013). Security issues and challenges of mobile cloud

computing. International Journal of Grid and Distributed Computing, 6(6),

37-50.

Webology, Volume 18, Special Issue on Artificial Intelligence in Cloud Computing

January, 2021

73 http://www.webology.org

Wang, Y., Streff, K., & Raman, S. (2012). Smartphone security challenges. Computer

45(12), 52-58.

Damshenas, M., Dehghantanha, A., Mahmoud, R., & Bin Shamsuddin, S. (2012).

Forensics investigation challenges in cloud computing environments. In IEEE

Proceedings Title: International Conference on Cyber Security, Cyber Warfare

and Digital Forensic (CyberSec), 190-194.

Faheem, M., Kechadi, T., & Le-Khac, N.A. (2015). The state of the art forensic

techniques in mobile cloud environment: A survey, challenges and current

trends. International Journal of Digital Crime and Forensics (IJDCF), 7(2),

1-19.

Zhu, M. (2011). Mobile cloud computing: implications to smartphone forensic

procedures and methodologies (Doctoral dissertation, Auckland University of

Technology).

Thomas, P., Owen, P., & McPhee, D. (2010). An analysis of the digital forensic

examination of mobile phones. In IEEE Fourth International Conference on Next

Generation Mobile Applications, Services and Technologies, 25-29.

Roussev, V., & McCulley, S. (2016). Forensic analysis of cloud-native

artifacts. Digital Investigation, 16, S104-S113.

Guido, M., Ondricek, J., Grover, J., Wilburn, D., Nguyen, T., & Hunt, A. (2013).

Automated identification of installed malicious Android applications. Digital

Investigation, 10, S96-S104.

Grover, J. (2013). Android forensics: Automated data collection and reporting from a

mobile device. Digital Investigation, 10, S12-S20.

Rajendran, S., & Gopalan, N.P. (2016). Mobile Forensic Investigation (MFI) life cycle

process for digital data discovery (DDD). In Proceedings of the International

Conference on Soft Computing Systems, New Delhi. Springer, 393-403.

Mumba, E.R., & Venter, H.S. (2014). Mobile forensics using the harmonised digital

forensic investigation process. In IEEE Information Security for South Africa,

1-10.

Goel, A., Tyagi, A., & Agarwal, A. (2012). Smartphone forensic investigation process

model. International Journal of Computer Science & Security (IJCSS), 6(5),

322-341.

Vidas, T., Zhang, C., & Christin, N. (2011). Toward a general collection methodology

for Android devices. Digital investigation, 8, S14-S24.

Kasiaras, D., Zafeiropoulos, T., Clarke, N., & Kambourakis, G. (2014). Android

forensics: Correlation analysis. In IEEE 9th International Conference for Internet

Technology and Secured Transactions (ICITST-2014), 157-162.

Webology, Volume 18, Special Issue on Artificial Intelligence in Cloud Computing

January, 2021

74 http://www.webology.org

Li, J., Gu, D., & Luo, Y. (2012). Android malware forensics: Reconstruction of

malicious events. In IEEE 32nd International Conference on Distributed

Computing Systems Workshops, 552-558.

Mahajan, A., Dahiya, M.S., & Sanghvi, H.P. (2013). Forensic analysis of instant

messenger applications on android devices. arXiv preprint arXiv:1304.4915.

Anglano, C. (2014). Forensic analysis of WhatsApp Messenger on Android

smartphones. Digital Investigation, 11(3), 201-213.

Grispos, G., Glisson, W.B., & Storer, T. (2013). Using smartphones as a proxy for

forensic evidence contained in cloud storage services. In IEEE 46th Hawaii

International Conference on System Sciences, 4910-4919.

Meng, F., Wu, S., Yang, J., & Yu, G. (2009). Research of an e-mail forensic and

analysis system based on visualization. In IEEE Asia-Pacific Conference on

Computational Intelligence and Industrial Applications (PACIIA), 1, 281-284.

Levinson, A., Stackpole, B., & Johnson, D. (2011). Third party application forensics

on apple mobile devices. In IEEE 44th Hawaii International Conference on

System Sciences, 1-9.

Al Mutawa, N., Baggili, I., & Marrington, A. (2012). Forensic analysis of social

networking applications on mobile devices. Digital Investigation, 9, S24-S33.

Al-Saleh, M.I., & Forihat, Y.A. (2013). Skype forensics in android

devices. International Journal of Computer Applications, 78(7), 38-44.

Gao, F., & Zhang, Y. (2013). Analysis of WeChat on iPhone. In 2nd international

symposium on computer, communication, control and automation, Atlantis Press,

278-281.

Greff, K., Srivastava, R.K., Koutník, J., Steunebrink, B.R., & Schmidhuber, J. (2016).

LSTM: A search space odyssey. IEEE transactions on neural networks and

learning systems, 28(10), 2222-2232.

Federal Rules of Evidence, https://www.law.cornell.edu/rules/fre/rule_401.

A Survey of Forensic Analysis and Information Visualization Approach for Instant Messaging Applications

Article

Full-text available

Jan 2023

Construction of Network Information Security Risk Framework Based on Data Mining and Analysis

Conference Paper

Oct 2022

Hengxing Chen

Digital Forensics Subdomains: The State of the Art and Future Directions

Article

Full-text available

Oct 2021

For reliable and relevant scientific evidence to be admitted in a court of law, it is important to apply digital forensic investigation techniques to corroborate a suspected potential security incident. Mainly, traditional digital forensics techniques have focused on computer desktops and servers. However, recent advances in digital media and platforms have seen an increased need for the application of digital forensic investigation techniques to other subdomains including small and mobile devices, databases, networks, cloud-based platforms, and the Internet of Things (IoT). To assist forensic investigators, conduct investigations within these subdomains, academic researchers have attempted to develop a number of investigative processes. However, many of these processes are domain-specific or describe domain-specific investigative tools. Hence, we hypothesize that the literature is littered with potentially overlapping and contradicting investigative process for conducting investigations within these subdomains. To investigate this hypothesis, a digital forensic model-orientated Systematic Literature Review (SLR) within the above digital forensic subdomains was undertaken. The purpose of the SLR was to identify the different and heterogeneous practices that have emerged within the specific subdomains. A key finding from the SLR is that there is a potential information overload and a high-degree of ambiguity among investigative processes in the above subdomains. The outcome of this study proposes a high-level abstract metamodel called The Digital Forensic Metamodel (DFM), which combines common processes, activities, techniques, and tasks for the above subdomains.

Forensic analysis of cloud-native artifacts

Article

Full-text available

Mar 2016
DIGIT INVEST

Forensic analysis of cloud artifacts is still in its infancy; current approaches overwhelming follow the traditional method of collecting artifacts on a client device. In this work, we introduce the concept of analyzing cloud-native digital artifacts–data objects that maintain the persistent state of web/SaaS applications. Unlike traditional applications, in which the persistent state takes the form of files in the local file system, web apps download the necessary state on the fly and leave no trace in local storage.

Mobile forensics using the harmonised digital forensic investigation process

Article

Full-text available

Nov 2014

Mobile technology is among the fastest developing technologies that have changed the way we live our daily lives. Over the past few years, mobile devices have become the most popular form of communication around the world. However, bundled together with the good and advanced capabilities of the mobile technology, mobile devices can also be used to perform various activities that may be of malicious intent or criminal in nature. This makes mobile devices a valuable source of digital evidence. For this reason, the technological evolution of mobile devices has raised the need to develop standardised investigation process models and procedures within the field of digital forensics. This need further supports the fact that forensic examiners and investigators face challenges when performing data acquisition in a forensically sound manner from mobile devices. This paper, therefore, aims at testing the harmonised digital forensic investigation process through a case study of a mobile forensic investigation. More specifically, an experiment was conducted that aims at testing the performance of the harmonised digital forensic investigation process (HDFIP) as stipulated in the ISO/IEC 27043 draft international standard through the extraction of potential digital evidence from mobile devices.

Android Forensics: Correlation Analysis

Conference Paper

Full-text available

Dec 2014

With over 6 billion mobile phone subscribers, it is inevitable that such devices will be involved in criminal activities. Furthermore, the evolution of smartphones has changed the way people use their mobile phones in their everyday life. That is, a huge variety of services exist in the device that can be exploited for either perpetrating a criminal activity or being the subject of the crime. By conducting an analysis of existing forensic tools and the literature, it became evident that there is a significant lack of advanced tools that enable the correlation among the various events of forensic interest in order to facilitate an investigation and reduce the cognitive load on the analyst side. Motivated by this fact, the paper at hand proposes a novel tool that incorporates strong mechanisms to forensically analyze an Android device, aiming to reduce the workload of the investigator through advanced and intelligent correlation and visualization.

The State of the Art Forensic Techniques in Mobile Cloud Environment: A Survey, Challenges and Current Trend

Article

Full-text available

Apr 2015

Smartphones have become popular in recent days due to the accessibility of a wide range of applications. These sophisticated applications demand more computing resources in a resource constraint smartphone. Cloud computing is the motivating factor for the progress of these applications. The emerging mobile cloud computing introduces a new architecture to offload smartphone and utilize cloud computing technology to solve resource requirements. The popularity of mobile cloud computing is an opportunity for misuse and unlawful activities. Therefore, it is a challenging platform for digital forensic investigations due to the nonavailability of methodologies, tools and techniques. The aim of this work is to analyze the forensic tools and methodologies for crime investigation in a mobile cloud platform as it poses challenges in proving the evidence

LSTM: A search space odyssey

Article

Full-text available

Mar 2015

Several variants of the Long Short-Term Memory (LSTM) architecture for recurrent neural networks have been proposed since its inception in 1995. In recent years, these networks have become the state-of-the-art models for a variety of machine learning problems. This has led to a renewed interest in understanding the role and utility of various computational components of typical LSTM variants. In this paper, we present the first large-scale analysis of eight LSTM variants on three representative tasks: speech recognition, handwriting recognition, and polyphonic music modeling. The hyperparameters of all LSTM variants for each task were optimized separately using random search and their importance was assessed using the powerful fANOVA framework. In total, we summarize the results of 5400 experimental runs (about 15 years of CPU time), which makes our study the largest of its kind on LSTM networks. Our results show that none of the variants can improve upon the standard LSTM architecture significantly, and demonstrate the forget gate and the output activation function to be its most critical components. We further observe that the studied hyperparameters are virtually independent and derive guidelines for their efficient adjustment.

Forensic Analysis of WhatsApp Messenger on Android Smartphones

Article

Full-text available

May 2014
DIGIT INVEST

Cosimo Anglano

We present the forensic analysis of the artifacts left on Android devices by WhatsApp Messenger, the client of the WhatsApp instant messaging system. We provide a complete description of all the artifacts generated by WhatsApp Messenger, we discuss the decoding and the interpretation of each one of them, and we show how they can be correlated together to infer various types of information that cannot be obtained by considering each one of them in isolation. By using the results discussed in this paper, an analyst will be able to reconstruct the list of contacts and the chronology of the messages that have been exchanged by users. Furthermore, thanks to the correlation of multiple artifacts, (s)he will be able to infer information like when a specific contact has been added, to recover deleted contacts and their time of deletion, to determine which messages have been deleted, when these messages have been exchanged, and the users that exchanged them.

Artificial Intelligence, Big Data, and Cloud Computing

Chapter

Mar 2021

Bernd W. Wirtz

By working through this chapter, you will be able to:

Security Issues and Challenges of Mobile Cloud Computing

Article

Dec 2013

Cloud computing is proving itself an emerging technology in IT world which provides a novel business model for organizations to utilize softwares, applications and hardware resources without any upfront investment. Few years later with the broad development in mobile applications and advancements in cloud computing, a new expansion is being expected in the form of mobile cloud computing (MCC). MCC provides a platform where mobile users make use of cloud services on mobile devices. The use of MCC minimizes the performance, compatibility, and lack of resources issues in mobile computing environment. Despite the astonishing advancement achieved by MCC, the users of MCC are still below expectations because of the associated risks in terms of security and privacy. These risks are playing important role by preventing the organizations to adopt MCC environment. Significant amount of research is in progress in order to reduce the security concerns but still a lot work has to be done to produce a security prone MCC environment. This paper presents a comprehensive literature review of MCC and its security issues and challenges.

Android Malware Forensics: Reconstruction of Malicious Events

Conference Paper

Jun 2012

Smart mobile devices have been widely used and the contained sensitive information is endangered by malwares. The malicious events caused by malwares are crucial evidences for digital forensic analysis, and the main task of mobile forensic analysis is to reconstruct these events. However, the reconstruction heavily relies on the code analysis of the malware. The difficulties and challenges include how to quickly identify the suspicious programs, how to defeat the anti-forensics tricks of malicious code, and how to deduce the malicious behaviors according to the code. To address this issue, we propose systematic procedures of analyzing typical malware behaviors on the popular mobile operating system Android. Based on the procedures we discuss the deduction of Android malicious events. We also give a real malware forensic case as a reference.

Forensics investigation challenges in cloud computing environments

Conference Paper

Jun 2012

Cloud computing discusses about sharing any imaginable entity such as process units, storage devices or software. The provided service is utterly economical and expandable. Cloud computing attractive benefits entice huge interest of both business owners and cyber thefts. Consequently, the “computer forensic investigation” step into the play to find evidences against criminals. As a result of the new technology and methods used in cloud computing, the forensic investigation techniques face different types of issues while inspecting the case. The most profound challenges are difficulties to deal with different rulings obliged on variety of data saved in different locations, limited access to obtain evidences from cloud and even the issue of seizing the physical evidence for the sake of integrity validation or evidence presentation. This paper suggests a simple yet very useful solution to conquer the aforementioned issues in forensic investigation of cloud systems. Utilizing TPM in hypervisor, implementing multi-factor authentication and updating the cloud service provider policy to provide persistent storage devices are some of the recommended solutions. Utilizing the proposed solutions, the cloud service will be compatible to the current digital forensic investigation practices; alongside it brings the great advantage of being investigable and consequently the trust of the client.

The Enhanced Forensic Examination and Analysis for Mobile Cloud Platform by Applying Data Mining Methods

Abstract and Figures

Recommended publications

Enhanced Forensic Process for Improving Mobile Cloud Traceability in Cloud-Based Mobile Applications

The State of the Art Forensic Techniques in Mobile Cloud Environment

The State of the Art Forensic Techniques in Mobile Cloud Environment: A Survey, Challenges and Curre...

Feature indexing and search optimization for enhancing the forensic analysis of mobile cloud environ...