ArticlePDF Available

BLADE: Robust malware detection against obfuscation in android

September 2021
Forensic Science International Digital Investigation 38(3):301176

September 2021
38(3):301176

DOI:10.1016/j.fsidi.2021.301176

Authors:

Vikas Sihag

Sardar Patel University of Police, Security and Criminal Justice, Jodhpur

Manu Vardhan

National Institute of Technology Raipur

Pradeep Singh

National Institute of Technology Raipur

Android OS popularity has given significant rise to malicious apps targeting it. Malware use state of the art obfuscation methods to hide their functionality and evade anti-malware engines. We present BLADE, a novel obfuscation resilient system based on Opcode Segments for detection. It makes three contributions: Firstly, a novel Opcode Segment Document results in feature characterization resilient to obfuscation techniques. Secondly, we perform semantics based simplification of dalvik opcodes to enhance the resilience. Thirdly, we evaluate effectiveness of BLADE against different obfuscation techniques such as trivial obfuscation, string encryption, class encryption, reflection and their combinations. Our approach is found effective, accurate and resilient, when tested against benchmark datasets for malware detection, familial classification, malware type detection, obfuscation type detection and obfuscation resilient familial classification. Dataset available on: https://www.kaggle.com/vikassihag/blade-dataset

Architecture of the proposed approach.

…

Comparative analysis of Android application obfuscation tools

…

Description of different datasets

…

Results: Familial classification by BLADE on AndroAutopsy, AndroTracker and

…

Results: Malware class detection by BLADE on AndroAutopsy dataset

…

Figures - uploaded by Vikas Sihag

Content may be subject to copyright.

Content uploaded by Vikas Sihag

Content may be subject to copyright.

BLADE: Robust Malware Detection against

Obfuscation in Android.

Vikas Sihaga,b,∗, Manu Vardhanb, Pradeep Singhb

aSardar Patel University of Police, Security and Criminal Justice, Jodhpur, India

bNational Institute of Technology, Raipur, India

Abstract

Android OS popularity has given signiﬁcant rise to malicious apps targeting

it. Malware use state of the art obfuscation methods to hide their functional-

ity and evade anti-malware engines. We present BLADE, a novel obfuscation

resilient system based on Opcode Segments for detection. It makes three con-

tributions: Firstly, a novel Opcode Segment Document results in feature char-

acterization resilient to obfuscation techniques. Secondly, we perform semantics

based simpliﬁcation of dalvik opcodes to enhance the resilience. Thirdly, we

evaluate eﬀectiveness of BLADE against diﬀerent obfuscation techniques such

as trivial obfuscation, string encryption, class encryption, reﬂection and their

combinations. Our approach is found eﬀective, accurate and resilient, when

tested against benchmark datasets for malware detection, familial classiﬁcation,

malware type detection, obfuscation type detection and obfuscation resilient

familial classiﬁcation.

Keywords: Android, Malware detection, Code obfuscation, Familial

classiﬁcation

1. Introduction

Android OS since its release in 2008, has grown as the most preferred choice

in the market with 72.26% share of 3.8 billion smartphone users worldwide

∗Corresponding author

Email addresses: vikas.sihag@policeuniversity.ac.in (Vikas Sihag),

mvardhan.cs@nitrr.ac.in (Manu Vardhan), psingh.cs@nitrr.ac.in (Pradeep Singh)

Preprint submitted to Forensic Science International: Digital Investigation May 12, 2021

in July 2020 [1]. Android’s popularity and its application distribution model

tenders to new attack surfaces targeting user’s privacy and security [2]. Recently5

among top 5000 Android apps on Play Store, 655 were found having zero-days

and 983 with known vulnerabilities [3]. Mobile attacks by cyber criminals have

increased from backdoors and crypto mining to click farming, ad fraud and fake

reviews using malicious applications (aka Apps). Malicious activities comprises

of information leakage, device failure or data corruption with selﬁsh or harmful10

motives.

Malware researchers are adopting state of the art application stealth tech-

niques such as advanced code obfuscation and protection mechanisms to evade

anti-malwares [4, 5, 6]. Current malwares are enhanced with code obfuscation,

encryption, dynamic loading and/or native code execution techniques to prevent15

app reversal [7, 8].

The process of understanding the functionality and infection of a malware

is popularly known as Malware Analysis. It is generally classiﬁed into static

(code) analysis and dynamic (behavioral) analysis. Static approach analyzes

code sequences without executing them, whereas dynamic approaches the run20

time execution [9, 10]. Static analysis is light weighted and has high code

coverage as compared to dynamic analysis [7, 11]. Dynamic analysis executes

and monitors an application, to track its behaviour, understand features and

identify technical indicators that can be used as detection signatures [12, 13, 14].

Malware analysis is generally tasked to detect an executable sample as malicious25

(i.e. malware detection) or to identify which malware family does it belong to

(i.e. familial classiﬁcation). App stealth techniques poses challenge towards

eﬃcient malware detection and familial classiﬁcation [15].

Obfuscation comprises of actions that modiﬁes an App code without chang-

ing its functionality or semantics [16]. Obfuscation techniques can be classiﬁed30

into trivial and non-trivial. Trivial techniques do not perform code level changes,

where as the non-trivial does. Trivial obfuscation methods such as repackag-

ing are used to attach malicious code(s) in legitimate apps. 86% of malware

samples were found to be using these methods [17]. Identiﬁcation of malicious

component in repackaged app is a challenge for malware analysis. Non-trivial35

obfuscation methods such as class encryption, string encryption, identiﬁer re-

naming, code reordering, reﬂection etc. modiﬁes the code semantics thus pre-

venting analysis and evading detection systems. For instance, Listing 1 shows

code fragment from DroidDream and its corresponding code 2 after identiﬁer

renaming. Semantic changes induced by obfuscation methods can easily evade40

signature based classiﬁcation.

1co ns t - s tr i ng v 15 , " pr o fi l e "

2co ns t - s t ri n g v 16 , " m ou n t - o re m ou n t rw s ys t em \ n e xi t \ m"

3in vo k e - s t at ic { v 15 , v 16 } , Lc om / an d ro i d / ro o t / Se t ti n g ; -> r u n Ro o t Co m ma n d ( L ja v a /45

la n g / St r in g ; L ja v a / la ng / S t ri n g ;) Lj a va / l a ng / S t ri n g ;

4mo ve - re s ul t - ob j ec t v 10

Listing 1: A bytecode fragment from DroidDream malware.

1co ns t - s tr i ng v 15 , " pr o fi l e "50

2co ns t - s t ri n g v 16 , " m ou n t - o re m ou n t rw s ys t em \ n e xi t \ m"

3in vo k e - s ta t ic { v1 5 , v 1 6} , L co m / hx b vg H / IW N cZ s / j FA b Ko ; - > ax D nB L ( Lj a va / l an g /

St r in g ; L ja v a / la n g / St r in g ; ) Lj a va / l a ng / S t ri ng ;

4mo ve - re s ul t - ob j ec t v 10

Listing 2: The bytecode after performing identiﬁer renaming on listing 1.

To address above challenges we propose BLADE ( roBust maLwAre DEtection

system), a novel obfuscation resilient approach based on opcode segments. We

ﬁrst generate .smali ﬁles of an input APK (an Android executable), followed by

Dalvik opcode [18] sequences from .smali ﬁles. As Dalvik opcodes represents

behavioral pattern of an application, it is then used to generate opcode sequences60

using simpliﬁcation. Opcode sequences are then segmented to represent an

APK as an Opcode Segments Document (OSD). Furthermore, OSD is used for

malware detection and familial classiﬁcation.

In short, the main contributions are summarized below:

−Opcode Segment Document: We analyzed Android applications from a65

diﬀerent perspective and proposed an Opcode Segments Document (OSD)

based novel approach for malware characterization.

−BLADE: We propose BLADE, an eﬃcient and eﬀective malware detection and

familial classiﬁcation system based on OSD.

−Obfuscation Resilient Evaluation: We evaluated eﬀectiveness of BLADE70

against popular obfuscation techniques such as trivial obfuscation, string

encryption, reﬂection, class encryption and their combinations.

−Typically Android apps contain single DEX ﬁle, but some may comprise

of multiple DEX ﬁles. BLADE is able to handle these complex apps, by

extracting features from multiple DEX ﬁles.75

−Scalable Detection: We evaluated and compared BLADE over bench mark

datasets. It is eﬀective and accurate for malware detection, familial clas-

siﬁcation and is obfuscation resilient. Overall, it achieves better perfor-

mance when compared with other state of the art approaches based on

several aspects.80

Paper organization: In Section 2, we describe Dalvik bytecode and obfus-

cation methods in Android apps as the background required for the proposed

work. Section 3 elaborates working and design principles of BLADE. Section

4 deﬁnes research questions and evaluates the performance of BLADE against

them. Section 5 contrasts the proposed work with the existing state of the solu-85

tions. Furthermore, related works is discussed in 6, followed by conclusion and

future direction in section 7.

2. Background

In this section we discuss the preliminary background knowledge required

for our work. We discuss Dalvik bytecode (Section 2.1) and popular obfuscation90

techniques (Section 2.2).

2.1. Dalvik Bytecode

Android has a distinct executable machine code format called Dalvik Byte-

code. Source code java .class ﬁles along with other .jar library ﬁles are

converted into dalvik executable classes.dex ﬁle. It along with compiled re-95

sources and shared object (.so) ﬁles is then compressed into an Android PacK-

age (APK) ﬁle. This APK ﬁle is downloaded for installation, when requested

from Google Play Store. A classes.dex ﬁle contains deﬁnitions of multiple

classes, with each comprising of multiple methods. While classes.dex is a

non-readable binary ﬁle, it can be disassembled into smali ﬁles, which are inter-100

mediate human readable format. Smali code generated from Dalvik bytecode

comprises of classes and its methods in each smali ﬁle. Each method contains

its operand(s). For instance, the instruction move-wide/from16 vBB, vAAAA

has move as the base opcode, wide (64-bit data) as the name suﬃx, from16105

(16-bit register reference) as the opcode suﬃx, vBB as the destination register

and vAAAA as the source register. Dalvik opcode constant lists 237 opcodes of

which only 217 are used in practice in APKs [19]. Being human readable Dalvik

bytecode is easier than machine code. Tools such as Androguard [20], APKTool

[21], and Dexdump are popular reverse engineering tools to extract APK dex110

code.

2.2. Android Application Obfuscation

In our context, the term obfuscation refers to transformation of an appli-

cation executable (APK) without altering its original functionality. Obfuscation

techniques employed by Android applications is a double-edged sword for an-115

alysts as it protects legit developers against code cloning as well the malware

authors against a range of analysis engines [22]. Following popular obfuscation

techniques pose challenge to malware analysis.

Trivial Obfuscation: It deﬁnes obfuscation methods which aﬀects the strings,

but the executable instructions in bytecode. It comprises of renaming ﬁles,120

ﬁelds, classes, methods and packages with random or predeﬁned nomenclature.

It also includes repackaging of the APK.

Repackaging: In repackaging, an APK is unpacked, re-packed and signed

with a new key to generate repackaged app. Popular applications are inserted

Table 1: Comparative analysis of Android application obfuscation tools

Tool Repackaging Flow

Obfuscation

String

Encryption

Class

Encryption

Resource

Encryption Reﬂection

Allatori [26] X X

APK Protect [27] X X X

Arxan X X X X

DexGuard [28] X X X X X X

DashO [29] X X X X

DexProtector [30] X X X X X X

Ijiami X X X X

Mobile Protector [31] X X X X X

ProGuard [32] X X X X X X

Promon Shield [33] X X X X X X

Stringer [34] X X X X

with malicious code and repackaged to be hosted on market places posing chal-125

lenge for user to verify its authenticity.

Control Flow Obfuscation: It is the process of rearrangement of instruc-

tions in a method, to evade control ﬂow analysis of instructions. This include

instruction patterns used by reverse-engineering tools to decompile the source

code.130

String Encryption: Strings often reveal malware identiﬁable information

such as names or URLs. String encryption could obstruct hard-coded string

based searching by rendering strings unreadable [22, 23]. In it original string is

stored in an encrypted form and requires an additional decryption function.

Class Encryption: Its an advanced code obfuscation technique which en-135

crypts a class. The encrypted class is decrypted and loaded at runtime by a

separate function. The computational overhead of class encryption is high along

with its resilience against static analysis [24].

Reﬂection: Reﬂection is a popular feature in Java to allow object interaction

at runtime. It is popular among developers to obfuscate sensitive library and140

API calls [25]. It transfers execution ﬂow to the desired code segment implicitly.

Resource Encryption: Resources and assets are used by malware for payload

or code hiding. This technique encrypts the application resources and are de-

crypted during execution. For example, Rootnik malware encrypted its resource

ﬁle to secData0.jar ﬁle [5].145

A comprehensive analysis of Android application obfuscation tools with ref-

Figure 1: Architecture of the proposed approach.

erence to their features and techniques is illustrated in table 1. Tools listed are

popular among developers used for applications hardening [5].

3. Design of BLADE

Overview150

We convert the problem of malware detection and familial classiﬁcation to

a document classiﬁcation problem. For a text document, characters are its

basic building blocks. Ordered set of characters form words, sentences and

paragraphs. We develop an Android malware detection system BLADE, which

represents an application as a document with opcode characters as its building155

blocks. BLADE is resilient to obfuscation and has high accuracy on malware

detection and familial classiﬁcation. Proposed approach includes two proce-

dures. Prior is to create the detection and classiﬁcation model. It follows with

prediction of an application for malware detection, familial classiﬁcation and

obfuscation detection. Its overall architecture is illustrated in ﬁgure 1.160

Malware detection training set comprises of malware apps and benign apps.

Training set for familial classiﬁcation includes malware samples of diﬀerent fam-

ily subsets. Training set for obfuscation detection comprises of malware samples

into diﬀerent obfuscation types. For obfuscation detection training set, we have

considered trivial obfuscation (T), string encryption (S), reﬂection (R), class165

encryption (C), trivial + string encryption (TS), trivial + string encryption +

reﬂection (TSR) and trivial + string encryption + reﬂection + class encryption

(TSRC).

As shown in the architectural diagram, APK sample to be predicted is pre-

processed to extract its DEX bytecode ﬁle, which is then used to extract .smali170

ﬁles. Each smali ﬁle speciﬁes methods and ﬁelds. Intermediate opcode sequences

are generated from .smali ﬁles. opcode sequences are simpliﬁed and segmented

to give opcode segments. An application thus is represented as an Opcode Seg-

ment Document (OSD). Each OSD is a collection of opcode segments, which

are then reduced and selected for detection. Furthermore, this model is used175

for obfuscation detection and familial classiﬁcation.

Obfuscation techniques mentioned in 2.2 are a challenge towards malware

detection. Our proposed solution mitigates some of these threats.

Opcode simpliﬁcation and OSD generation

Proposed approach represents each malware sample with Opcode Segment180

Document (OSD) generated from its DEX code. As outlined in 2.1, DEX

code represents instruction level operation code. We decompile and extract

.smali ﬁles from DEX code using APKTool [21]. We analyze .smali ﬁles

and have grouped multiple instructions from them based on their usage. In-

struction performing same operation but on diﬀerent register indices are con-185

sidered similar. For example, both Dalvik instructions "move vA, vB" and

"move/from16 vAA, vBBBB", move contents from one register to another; the

diﬀerence is number of bits of registers to move. All instructions based on their

semantics are attributed into 19 symbolic groups. Table 2 establishes symbols

attributed to 224 dalvik instructions. For example, symbol Arepresents all in-190

struction of arithmetic operations like add-int,add-int/2addr or sub-int.

Instruction nop is responsible for no operation are not allotted any symbol,

thus if encountered are skipped. This grouping of similar opcodes (dalvik in-

structions) based on semantics is deﬁned as Opcode Simpliﬁcation. Opcode

simpliﬁcation results into an application represented as a collection of opcode195

sequences.

Table 2: Symbolic representation of Dalvik instruction set

Semantics Opcode preﬁxes Number Symbol

Arithmetic

50 A

Casting check-cast 1 H

Comparison cmp-long |cmpg-double |cmpg-ﬂoat |cmpl-double |cmpl-ﬂoat 5 C

Deﬁnition const |const-class |const-string |const-wide 11 D

Inline execute-inline 1 U

invoke-super-quick |invoke-virtual |invoke-virtual-quick

15 V

Instance ﬁll-array-data |ﬁlled-new-array |instance-of |new-array |new-instance 6 F

Jump goto 3 J

not-long |or-int |or-long |xor-int |xor-long

24 L

Monitor monitor-enter |monitor-exit 2 E

move-result-wide |move-wide

13 M

Read

24 G

Return return |return-object |return-void |return-wide 4 R

Switch packed-switch |sparse-switch 2 S

Throw throw 1 O

Type Change

int-to-long |int-to-short |long-to-double |long-to-ﬂoat |long-to-int

15 T

Write

24 P

Furthermore, an opcode sequence is divided into opcode segments. An op-

code segment is an functional block of opcode instructions in succession. A new

segment is created by breaking opcode sequence at locations where there exists a

diversion of ﬂow control. For example, a block of opcode sequence DFFPDJGDGVM200

is divided into DFFPD and GDGVM based on pivot opcode Jcorresponding to a

jump. Furthermore, nop instructions are skipped during symbol mapping as

they do not add functional value to the code. Working of OSD generation is

described in Algorithm 1.

Algorithm 1: OSD Generation

Input: sample.APK

Output: Opcode Segment Document of the sample

Initialize OSD ﬁle

Extract DEX ﬁles from sample.APK

foreach DEX ﬁle do

Extract .smali ﬁles

end

foreach .smali ﬁle do

Initialize OpcodeSegment

Extract instructions

Ignore instruction operands

foreach instruction do

if instruction is nop then

continue

end

if instruction is for control diversion then

Create new OpcodeSegment

continue

else

Map instruction to Symbol using Symbol Table

Append the Symbol to OpcodeSequence

end

Append OpcodeSegment to OSD

end

Feature Extraction205

To make an OSD document classiﬁable, we perform feature extraction that

is to convert the document into a set of features. Each opcode segment word

in the OSD is treated as a feature with its frequency as a feature value. We

generate a feature vector representation of opcode segment words, quantiﬁed

with number of occurrences of each in an OSD.210

Attribute Selection

Feature extraction discussed above output features, of which many are irrele-

vant. We use attribute selection to choose signiﬁcant features from the extracted

ones. During attribute selection we evaluate the worth of each feature by cal-

culating its information gain. Information gain depicts the entropy reduction215

due to a classiﬁcation, thus capturing feature eﬀectiveness with reference to the

class. Formally, let F be a set of features to be classiﬁed into Mclasses and Fm

denote the m-th subclass. Then, the entropy of Fis:

E(F)=−Õ

m∈M

|Fm|

|F|×log2

|Fm|

|F|

For a feature fwith Vas the set of its possible values, let Fvdenote the

sample subset with feature value vfor A[35]. Thus information gain of the220

feature fcan be calculated as:

IG(F,f)=E(F) − Õ

v∈V(f)

|Fv|

|F|×E(Fv)

Features are then ranked based on correlation to class by calculating infor-

mation gain value.

Classiﬁcation Model

We implement classiﬁcation and detection phase in BLADE by implementing225

machine learning approaches. The representation of sensitive behaviors enables

us to detect and classify malware samples eﬀectively using learning techniques.

Table 3: Description of diﬀerent datasets

Dataset # benign # malwares # families Year of release

AndroAutopsy 109193 9990 30 2015

AndroTracker 51179 4554 20 2015

Drebin - 5560 179 2014

PRAGuard (Malgenome) - 8750 23 2015

PRAGuard (Contagio) - 1652 - 2015

We select J48, k-NN, Random Forest (RF) and Sequential Minimal Optimiza-

tion (SMO) for unsupervised learning. Our system is trained on labeled data

and then evaluated on testing data.230

4. Performance Evaluation

In this section, we ﬁrst introduce datasets and evaluation parameters. It

follows with the evaluation of our proposed approach against the following Re-

search questions.

RQ1: Can BLADE detect malware samples with high accuracy? (Malware detec-235

tion)

RQ2: Can BLADE eﬀectively classify malware samples into their respective fam-

ilies? (Familial Classiﬁcation)

RQ3: Can BLADE classify malware samples into their classes with high TPR and

low FPR? (Malware Class/Type Detection)240

RQ4: Can BLADE eﬀectively detect obfuscation type used by a malware? (Ob-

fuscation Detection)

RQ5: Can BLADE be resilient to obfuscation methods while classifying malware

samples? (Familial Classiﬁcation)

4.1. Datasets and Evaluation Metrics245

In order to answer above mentioned research questions we evaluate BLADE

against diﬀerent benchmark datasets. We selected four Android application

datasets namely: AndroAutosy [36], AndroTracker [37], Drebin [38] and An-

droid PRAGuard [23]. Table 3 describes the datasets used.

Table 4: Malware detection and classiﬁcation evaluation metrics.

Term Abbreviation Deﬁnition

True Positive T P No. of samples correctly detected as malware or correctly

classified into family f.

True Negative T N No. of samples correctly detected as benign or correctly

not classified into family f.

False Positive F P No of sample incorrectly detected as malware or incor-

rectly classified into family f.

False Negative F N No of sample incorrectly detected as benign or incorrectly

not classified into family f.

Precision p T P /(T P +F P)

Recall r T P/(T P +F N )

F-measure F12r p/(r+p)

ROC Area AUC Area under ROC curve

Accuracy Acc Percentage of malwares correctly detected or classified

AndroAutopsy contains 109193 benign and 9990 malware samples classiﬁed250

into 30 families [36]. AndroTracker contains 51179 benign and 4554 malware

samples classiﬁed into 20 families [37]. Malware samples in AndroTracker in-

cludes four categories, which are Adware, Downloader, Riskware and Trojan.

Whereas, Drebin contains only malicious samples (5560) in 179 families [38].

These three datasets are used to answer RQs pertaining to malware detection,255

familial classiﬁcation and malware class detection.

To evaluate obfuscation resilience of BLADE, we selected Android PRA-

Guard dataset, which is a collection of obfuscated malware samples. It contains

10479 obfuscated malware samples, generated by applying diﬀerent obfuscation

methods on Malgenome [17] and Contagio MiniDump [39]. It employed trivial260

obfuscation, string encryption, reﬂection, class encryption obfuscation methods

and their combinations. Obfuscated malwares in Android PRAGuard generated

from Malgenome are classiﬁed into 23 family labels. We use Android PRAGuard

to answer RQs related to obfuscation resilience and classiﬁcation of obfuscated

malwares.265

Table 4 lists the evaluation parameters employed to evaluate BLADE.

4.2. Methods for Performance Comparison

We selected four machine learning algorithms as appropriate classiﬁers for

our approach, namely: J48 decision tree (number of folds = 3; conﬁdence factor

Table 5: Results: Malware detection by BLADE on AndroAutopsy and AndroTracker datasets

Method T P R F P R AUC Ac c(%) Metho d T P R F P R AUC Ac c(%)

AndroAutopsy AndroTracker

J48 0.972 0.030 0.973 97.21 J48 0.984 0.016 0.986 98.39

k-NN 0.978 0.025 0.985 97.75 k-NN 0.985 0.015 0.993 98.54

RF 0.982 0.023 0.997 98.18 RF 0.988 0.016 0.999 98.78

SMO 0.974 0.027 0.973 97.37 SMO 0.977 0.022 0.977 97.70

= 0.25 ), k-nearest neighbors (k=1), Random Forest (number of trees = 100)270

and SMO (complexity parameter=1; tolerance parameter=0.001). We do not

abandon any features in the experiments. We use above algorithms for training

and testing. We selected 10-fold cross validation for testing.

4.3. RQ1: Can BLADE detect malware samples with high accuracy?

Malware detection problem deals with identiﬁcation of malicious samples275

amongst benign ones. We considered AndroAutopsy (benign=109193 &mal-

ware=9990 ) and AndroTracker (benign=51179 &malware=4554 ) datasets to

evaluate malware detection performance of BLADE equipped with four diﬀer-

ent classiﬁers. detection accuracy of our approach. Table 5 shows the results of

BLADE against TPR,FPR,AUC and Acc parameters. Following conclusions280

are drawn from it:

•All classiﬁers perform satisfactorily on both datasets with accuracy (greater

than 97%).

•Random Forest outperforms other classiﬁers in almost all parameters. k-

NN (FPR=0.015) slightly outperforms Random Forest (FPR=0.016) in285

terms of false positive rate when evaluated on AndroTracker.

⇒RQ1 Answer: BLADE can eﬀectively detect malware samples with

high accuracy.

4.4. RQ2: Can BLADE eﬀectively classify malware samples into their respective

families?

The problem of classifying malicious samples into respective malware fam-290

ilies is popularly known as familial classiﬁcation. For performance evaluation

of BLADE we considered three benchmark datasets, which are AndroAutopsy,

AndroTracker and Drebin. Malware samples in AndroAutopsy (9990 samples)

and AndroTracker (4554) dataset are categorized into 30 and 20 families re-

spectively. We selected top 20 families from Drebin dataset for evaluation. All295

four classiﬁers are tested against above three datasets for familial classiﬁca-

tion. Table 6 shows the results of BLADE against TPR,FPR,AUC and ACC

parameters. Following conclusions are drawn from it:

•All classiﬁers perform satisfactorily on AndroAutopsy, AndroTracker and

Drebin with accuracy greater than 94% and AUC greater than 0.993.300

•SMO classiﬁer is more eﬀective than J48, k-NN and RF in terms of TPR,

FPR and accuracy.

•Performance of Random Forest is better in term of AUC parameter.

Weighted average AUC of Random Forest on AndroTracker is 1.

Table 7 illustrates detailed familial classiﬁcation performance analysis of305

BLADE with SMO when applied on top 20 families in Drebin. Dataset com-

prised of 4664 malware samples categorized into 20 families. Since family

datasets are imbalanced, F1measure is a preferred choice for comparison. BLADE

with SMO classiﬁer is eﬀective with weighted average F1measure of 0.985, ac-

curacy of 98.47% and F PR of 0.002. However, F1measure of only LinuxLotoor310

and Glodream families are between 0.88 and 0.90. This behavior is due to fewer

samples in a family and inter-family similarity.

⇒RQ2 Answer: BLADE can eﬀectively classify malicious samples into

their families with high accuracy and F-measure

Table 6: Results: Familial classiﬁcation by BLADE on AndroAutopsy, AndroTracker and

Drebin datasets

Method T P R F P R AU C Ac c(%) T P R F PR AUC Acc (%) T P R F P R AUC Acc (%)

AndroAutopsy AndroTracker Drebin

J48 0.936 0.005 0.976 93.62 0.980 0.004 0.994 97.96 0.975 0.003 0.989 97.49

k-NN 0.932 0.006 0.985 93.19 0.983 0.002 0.998 98.29 0.963 0.004 0.989 96.33

RF 0.944 0.006 0.996 94.35 0.984 0.003 1.000 98.44 0.980 0.002 0.999 98.01

SMO 0.950 0.004 0.993 94.97 0.986 0.002 0.998 98.59 0.985 0.002 0.995 98.47

Table 7: Familial classiﬁcation performance of BLADE with SMO for Drebin dataset (top 20

families)

Family #TP R FP R p r F1AUC Family #T P R F P R p r F1AUC

Adrd 91 0.989 0.000 0.989 0.989 0.989 0.998 GinMaster 339 0.991 0.000 0.994 0.991 0.993 1.000

BaseBridge 330 0.976 0.000 0.997 0.976 0.986 0.992 Glodream 69 0.826 0.000 0.983 0.826 0.898 0.960

DroidDream 81 0.951 0.000 0.987 0.951 0.969 0.981 Iconosys 152 1.000 0.000 1.000 1.000 1.000 1.000

DroidKungFu 667 0.991 0.004 0.975 0.991 0.983 0.994 Imlog 43 0.953 0.000 1.000 0.953 0.976 1.000

LinuxLotoor 70 0.855 0.001 0.922 0.855 0.887 0.959 Kmin 147 0.993 0.000 0.993 0.993 0.993 1.000

FakeDoc 132 0.992 0.000 1.000 0.992 0.996 0.998 MobileTx 69 1.000 0.000 1.000 1.000 1.000 1.000

FakeInstaller 925 0.987 0.002 0.990 0.987 0.989 0.996 Opfake 613 0.997 0.006 0.961 0.997 0.978 0.997

FakeRun 61 1.000 0.000 0.984 1.000 0.992 1.000 Plankton 625 0.998 0.001 0.994 0.998 0.996 0.999

Gappusin 58 1.000 0.001 0.951 1.000 0.975 1.000 SendPay 59 0.983 0.000 1.000 0.983 0.991 0.986

Geinimi 92 0.967 0.000 1.000 0.967 0.983 0.995 SMSreg 41 0.902 0.000 1.000 0.902 0.949 0.971

Weighted Avg. 0.985 0.002 0.985 0.985 0.985 0.995

4.5. RQ3: Can BLADE classify malware samples into their classes with high

TPR and low FPR?315

Malware based on their behavior are categorized into types or classes such

as Adware and Trojan. We test eﬀectiveness of BLADE in detecting malware

classes against AndroAutopsy, which categorizes its malware samples into ﬁve

major classes namely: Adware, Downloader, Riskware, Rooter and Trojan. Ta-

ble 8 illustrates eﬃcacy of BLADE while while categorizing malicious samples320

into behavior based classes. Following conclusions are drawn from it.

•All classiﬁers perform satisfactory with accuracy more than 96.5%.

•SMO classiﬁer is more eﬀective in correctly classifying the samples. With

better hit rate and low fall-out rate.

•Random Forest classiﬁer is more capable of distinguishing between the325

classes with AUC of 0.997.

⇒RQ3 Answer: BLADE can eﬀectively distinguish between malicious

samples from diﬀerent classes.

Table 8: Results: Malware class detection by BLADE on AndroAutopsy dataset

Method TPR FPR AUC Ac c (%)

AndroAutopsy

J48 0.965 0.028 0.974 96.54

k-NN 0.967 0.029 0.988 96.70

RF 0.967 0.041 0.997 96.69

SMO 0.975 0.022 0.980 97.53

4.6. RQ4: Can BLADE eﬀectively detect obfuscation type used by a malware?

As discussed in section 2.2, malware authors enhance their applications

with obfuscation techniques to evade detection. We test eﬃcacy of BLADE330

while dealing with obfuscated samples. In this subsection we try to answer,

whether our approach is able to diﬀerentiate between malware samples ob-

fuscated with diﬀerent methods. We chose Android PRAGuard [23] dataset

for it. Android PRAGuard comprises of malware samples from Malgenome

and Contagio datasets obfuscated with multiple methods such as trivial obfus-335

cation, string encryption, reﬂection, class encryption and their combinations.

We created sub-datasets from Android PRAGuard to have a detailed analysis.

PRAGuard Malgenome (T, S, R & C) and PRAGuard Contagio (T, S, R &

C) datasets comprise of samples obfuscated either by Trivial, String encryp-

tion, Reﬂection or Class encryption. While PRAGuard Malgenome (T, S, R, C,340

TS, TSR & TSRC) and PRAGuard Contagio (T, S, R, C, TS, TSR & TSRC)

datasets comprise of sample enhanced with multiple methods also. Following

conclusions are drawn from results illustrated in Table 9.

•J48, Random Forest and SMO classiﬁers are eﬀective in obfuscation type

detection. k-NN classiﬁer based approach is less eﬀective than others.345

•BLADE with J48 classiﬁer is eﬀective to distinguish between samples en-

hanced using single obfuscation methods with accuracy 99.44% (PRA-

Guard Malgenome) and 98.83% (PRAGuard Conatagio).

•BLADE is more eﬀective on PRAGuard Malgenome (T, S, R & C) with

accuracy 99.44% than PRAGuard Malgenome (T, S, R, C, TS, TSR &350

TSRC) with accuracy 93.53%. It also is more eﬀective on PRAGuard

Table 9: Results: Obfuscation type detection on PRAGuard dataset

Method TPR FPR AUC Ac c (%) Method TPR FPR AUC A c c (%)

PRAGuard Malgenome (T, S, R & C) PRAGuard Contagio (T, S, R & C)

J48 0.994 0.002 0.999 99.44 J48 0.988 0.004 0.996 98.83

k-NN 0.922 0.026 0.979 92.24 k-NN 0.863 0.046 0.965 86.33

RF 0.991 0.003 1 99.10 RF 0.978 0.007 0.998 97.78

SMO 0.992 0.003 0.995 99.18 SMO 0.981 0.006 0.991 98.09

PRAGuard Malgenome PRAGuard Contagio

(T, S, R, C, TS, TSR & TSRC) (T, S, R, C, TS, TSR & TSRC)

J48 0.935 0.011 0.980 93.53 J48 0.921 0.013 0.978 92.09

k-NN 0.852 0.025 0.955 85.19 k-NN 0.857 0.024 0.957 85.68

RF 0.916 0.014 0.993 91.63 RF 0.917 0.014 0.990 91.66

SMO 0.920 0.013 0.983 92.03 SMO 0.923 0.013 0.979 92.27

[ T: Trivial; S: String Encryption; R: Reflection; C: Class Encryption; TS: Trivial and String Encryption; TSR:

Trivial, String encryption and Reflection; TSRC: Trival, String Encryption, Reflection and Class Encryption ]

Contagio (T, S, R & C) with accuracy 98.83% than PRAGuard Contagio

(T, S, R, C, TS, TSR & TSRC) with accuracy 92.27%. Thus BLADE

performs better on single obfuscated samples than combinatory.

⇒RQ4 Answer: BLADE can eﬀectively diﬀerentiate type of ob-

fuscation used by a malicious sample. It also performs well against

samples enhanced with multiple obfuscation techniques.

355

4.7. RQ5: Can BLADE be resilient to obfuscation methods while classifying

malware samples?

To evaluate the resilience of BLADE against obfuscation methods, we per-

form familial classiﬁcation of obfuscated samples from PRAGuard Dataset. We

created seven subset from Android PRAGuard (Malgenome) on the basis of ob-360

fuscation methods. We then measure how well our approach can identify families

amongst each sub-dataset (T, S, R, C, TS, TSR & TSRC). Each sub-dataset

comprised of 1250 samples categorized into 23 families. Table 10 shows accu-

racy of familial classiﬁcation when applied on above sub-datasets. Following

conclusions are drawn from it.365

•BLADE is resilient to Trivial, String encryption, Reﬂection and their com-

binatory techniques.

Table 10: Results: Familial classiﬁcation accuracy (%) of obfuscated malware samples from

PRAGuard Malgenome dataset.

Method T S R C TS TSR TSRC

J48 98.60 97.86 98.77 92.77 97.87 98.53 86.65

k-NN 97.29 96.72 97.70 83.74 96.97 97.05 90.58

RF 98.69 98.44 98.61 85.97 98.37 98.20 91.32

SMO 99.02 99.02 99.18 91.87 99.26 98.69 92.47

[T: Trivial; S: String Encryption; R: Reflection; C: Class Encryption; TS: Trivial and String Encryption; TSR:

Trivial, String encryption and Reflection; TSRC: Trival, String Encryption, Reflection and Class Encryption ]

•BLADE is less resilient against Class encryption and its combinatory when

compared with other obfuscation methods. But it is still eﬀective in de-

tecting Class encryption with 92.77% accuracy.370

•SMO classiﬁer performs better than other classiﬁers in most cases.

⇒RQ5 Answer: BLADE is resilient to obfuscation methods while clas-

sifying malware sample with high accuracy.

5. Discussion

In this section, we compare our proposed system against state of the art

malware detection systems in Android. Table 11 compares performance of the375

proposed work with DANDroid [40]. The comparison is with reference to var-

ious obfuscation methods and their combination. DANDroid use DexProtec-

tor tool to obfuscate Drebin dataset, where as results of BLADE are based

on Malgenome dataset obfuscated using PRAGuard tool [30, 23]. DANDroid

uses Discriminative Adversarial Network based on neural network for detection.380

Both the approaches performs well against obfuscation methods apart from class

encryption which shows a small dip in the accuracy.

Eﬃciency and performance of the proposed solution is compared with pre-

vious studies in table 12. We have listed features used for malware detection or

classiﬁcation, furthermore the dataset(s) with the technique(s) employed. Few385

works like, Millar et al. [40] and Garcia et al. [] are evaluating their work on

both non-obfuscated and obfuscated samples.

Table 11: Classiﬁcation accuracy comparison of DANDroid and BLADE (proposed work).

Obfuscation DANDroid[40] BLADE

Trivial - 99.02

String Encryption 98.8 99.02

Reflection 99 99.18

Class Encryption 95.1 92.77

Resource Encryption 98.7 -

All obfuscations applied 95.3 92.47

Table 12: Comparison of BLADE with the existing state of the art solutions. [OD: Perfor-

mance over obfuscated dataset]

Paper Year Features Techniques Dataset Acc (%)

Arp et al. [38] 2014 Hardware, API Calls, App

components, Intents, Per-

missions and Network ad-

dresses

SVM Drebin 93.9

Fereidooni et al. [41] 2016 Intent, API Calls and Per-

missions

SVM, DT, NB, LR, RF,

KNN, Adaboost, DL, XG-

boost

Genome, Drebin, Virus Total 97

Karbab et al. [42] 2016 Binary, Assembly, Manifest

and APK

Permissions, API calls, Net-

work addresses, APK

Drebin, Genome 87

Mariconti et al. [43] 2017 API Calls Markov Chain Model Drebin 87

Feizollah et al. [44] 2017 Intents and Permissions Bayesian Network Drebin, Google PlayStore 95.5

Wang et al. [13] 2017 App components, Intents,

Permissions, API calls,

strings, commands and

network information

Dempster-Shafer theory

based fusion of KNN,

random forest and J48

classifiers

Drebin and Android Malware

Genome Project

99.7

Garcia et al. [45] 2018 Permissions, App Compo-

nents and Intent filters

SVM Malgenome, Drebin, Virus

Share and Virus Total

Garcia et al. [45] 2018 Permissions, App Compo-

nents and Intent filters

SVM Malgenome, Drebin, Virus

Share and Virus Total

86 [OD]

Machiry et al. [46] 2018 Code loops RF Malgenome and Virus Share 99.1

Alshahrani et al. [47] 2018 Permissions, system informa-

tion, system calls, network

information

SGD, RMSProp, Adagrad,

Adam, Nadam, Adadelta and

Adamax

Drebin and MARVIN 95.13

Alazab [48] 2020 API Calls Naive Bayes, kNN, RF, J48,

SMO, Logistic Regressions,

Adaboost, JRip, Random

committee, Simple logistics

VirusTotal, AndroZoo, Mal-

Share, Contagio and Google

PlayStore

98.1

Millar et al. [40] 2020 Opcode instructions, permis-

sions, API calls and com-

mands

DAN, CNN, Neural Nets Drebin and self obfuscated 97.3

Millar et al. [40] 2020 Opcode instructions, permis-

sions, API calls and com-

mands

DAN, CNN, Neural Nets Drebin and self obfuscated 59.6 [OD]

Sihag et al. (Pro-

posed Work)

2020 Opcode instructions k-NN, J48, RF and SMO Drebin, Contagio,

Malgenome, PRAGuard

98.6

Sihag et al. (Pro-

posed Work)

2020 Opcode instructions k-NN, J48, RF and SMO Drebin, Contagio,

Malgenome, PRAGuard

92.47 [OD]

6. Related Works

Android is a market mover and popular target among malware authors.

There are several studies on obfuscation techniques used by Android malware390

and their evolving detection methods.

Obfuscation and its eﬀectiveness

Obfuscation methods are a new normal for both developers and malware

authors. Tam et al. [12], Nigam [49] and Suarez-Tangil [50] have extensively

discussed the evolution of Android malware over the last decade. Apvrille and395

Nigam in [25] explores the practical usage of stealth techniques by Android mal-

ware. Faruki et al. in [16] discussed obfuscation methods, application protection

and deobfuscation methods speciﬁc to Android.

Dong et al. in [22] provided an understanding into Android code obfus-

cation and carried out a large scale investigation on 114,560 samples for its400

usage. Various static and dynamic code obfuscation approaches are presented

in [22, 51, 52, 53, 54] such as renaming, string encryption, control ﬂow ob-

fuscation and reﬂections. Eﬀectiveness of these obfuscation are evaluated in

[55, 56, 4, 23, 57, 58, 59, 60, 61]. Park et al. in [58], empirically analyzed ap-

plication similarity between original software and the one transformed by code405

obfuscation. Furthermore, it tried to question the legality of the obfuscated

app. State of art deobfuscation methods are proposed in [62, 63, 64].

Detection using Opcodes

Opcodes which represent application code at instruction level are popularly

used static analysis approach. Statistical properties of application opcodes are410

useful for malware detection. Multiple studies have evaluated its eﬀectiveness for

classiﬁcation. Hang et al. [65] proposes simpliﬁcation of 218 dalvik opcode and

was more eﬀective than anti-malware softwares. Chen et al. [66] also performs

simpliﬁcation but only of 107 representative opcodes. Canfora et al. [67] divided

opcodes into n-grams for detection. It used frequency characteristic, which are415

then fed into SVM and RF classiﬁers. They concluded that n-gram approach

with n=2 was most accurate for malware detection. Hahn et al. [68] included

both opcode sequence and opcode frequency for classiﬁcation using machine

learning (Bayesian Network, k-NN and Random Forest). Mclaughlin et al. [69]

employed CNN for deep learning based on opcode sequences. They concluded it420

to be more eﬀective than n-gram approach while considering scalability. Other

approaches have also employed similarity measure on opcode sequences or n-

grams for classiﬁcation [70, 71].

7. Conclusion

Malware detection and its classiﬁcation is a complex problem involving dis-425

tinct feature identiﬁcation and selection from malware samples. The task gets

more complicated with malware employing obfuscation methods to evade such

identiﬁcation. This paper introduces BLADE, a novel system based on Opcode

Segment Document (OSD) for malware detection and familial classiﬁcation. It

is eﬀective, accurate and resilient to obfuscation. BLADE relies on opcode430

segments, which represents sequential instruction. We evaluated it to answer

research questions of malware detection, malware familial classiﬁcation, mal-

ware class/type detection, obfuscation type detection and familial classiﬁcation

of obfuscated samples. BLADE was tested against benchmark datasets AndroAu-

topsy, AndroTracker, Drebin and Android PRAGuard. It is found eﬀective in435

detecting samples using multiple obfuscation techniques.

As part of the future work, we need to explore obfuscation methods where

malicious code is located outside the DEX ﬁle, such as native code and libraries.

Furthermore, we plan to explore the behavioral representation of ﬁne-grained

opcode segments against with the behavioral abstraction from dynamic analysis.440

References

[1] Number of smartphone users worldwide from 2016 to 2021.

URL https://www.statista.com/statistics/330695/

number-of-smartphone-users-worldwide/

[2] N. Grover, J. Saxena, V. Sihag, Security Analysis of OnlineCabBooking445

Android Application, 2017, pp. 603–611.

[3] O. Alrawi, C. Zuo, R. Duan, R. P. Kasturi, Z. Lin, B. Saltaformaggio,

The betrayal at cloud city: An empirical analysis of cloud-based mobile

backends, in: 28th {USENIX}Security Symposium ({USENIX}Security

19), 2019, pp. 551–566.450

[4] M. Dalla Preda, F. Maggi, Testing android malware detectors against

code obfuscation: a systematization of knowledge and uniﬁed methodol-

ogy, Journal of Computer Virology and Hacking Techniques 13 (3) (2017)

209–232.

[5] V. Sihag, M. Vardhan, P. Singh, A survey of android application and mal-455

ware hardening, Computer Science Review 39 (2021) 100365.

[6] A. Bacci, A. Bartoli, F. Martinelli, E. Medvet, F. Mercaldo, Detection of

obfuscation techniques in android applications, in: Proceedings of the 13th

International Conference on Availability, Reliability and Security, 2018, pp.

1–9.460

[7] H.-J. Zhu, Z.-H. You, Z.-X. Zhu, W.-L. Shi, X. Chen, L. Cheng, Droiddet:

eﬀective and robust detection of android malware using static analysis along

with rotation forest model, Neurocomputing 272 (2018) 638–646.

[8] Y. Feng, S. Anand, I. Dillig, A. Aiken, Apposcopy: Semantics-based de-

tection of android malware through static analysis, in: Proceedings of the465

22nd ACM SIGSOFT International Symposium on Foundations of Software

Engineering, 2014, pp. 576–587.

[9] H. Kang, J.-w. Jang, A. Mohaisen, H. K. Kim, Detecting and classifying

android malware using static analysis along with creator information, In-

ternational Journal of Distributed Sensor Networks 11 (6) (2015) 479174.470

[10] J.-w. Jang, H. K. Kim, Function-oriented mobile malware analysis as ﬁrst

aid, Mobile Information Systems 2016.

[11] J. Li, L. Sun, Q. Yan, Z. Li, W. Srisa-An, H. Ye, Signiﬁcant permission

identiﬁcation for machine-learning-based android malware detection, IEEE

Transactions on Industrial Informatics 14 (7) (2018) 3216–3225.475

[12] K. Tam, A. Feizollah, N. B. Anuar, R. Salleh, L. Cavallaro, The evolution

of android malware and android analysis techniques, ACM Computing Sur-

veys (CSUR) 49 (4) (2017) 1–41.

[13] X. Wang, D. Zhang, X. Su, W. Li, Mlifdect: android malware detection

based on parallel machine learning and information fusion, Security and480

Communication Networks 2017.

[14] V. Sihag, A. Swami, M. Vardhan, P. Singh, Signature based malicious

behavior detection in android, in: International Conference on Computing

Science, Communication and Security, Springer, 2020, pp. 251–262.

[15] P. Sharma, V. K. Sihag, Hybrid Single Sign-On Protocol for Lightweight485

Devices, 2016, pp. 679–684.

[16] P. Faruki, H. Fereidooni, V. Laxmi, M. Conti, M. Gaur, Android code

protection via obfuscation techniques: past, present and future directions,

CoRR abs/1611.10231.

[17] Y. Zhou, X. Jiang, Dissecting android malware: Characterization and evo-490

lution, in: 2012 IEEE symposium on security and privacy, IEEE, 2012, pp.

95–109.

[18] J. G. de la Puerta, B. Sanz, I. Santos, P. G. Bringas, Using dalvik opcodes

for malware detection on android, in: International Conference on Hybrid

Artiﬁcial Intelligence Systems, Springer, 2015, pp. 416–426.495

[19] A. Bartel, J. Klein, Y. Le Traon, M. Monperrus, Dexpler: Converting an-

droid dalvik bytecode to jimple for static analysis with soot, in: Proceedings

of the ACM SIGPLAN International Workshop on State of the Art in Java

Program Analysis, SOAP 12, Association for Computing Machinery, New

York, NY, USA, 2012, p. 2738. doi:10.1145/2259051.2259056.500

URL https://doi.org/10.1145/2259051.2259056

[20] A. Desnos, et al., Androguard: Reverse engineering, malware and goodware

analysis of android applications... and more (ninja!), Retrieved June 10

(2011) 2014.

[21] R. Winsniewski, Android–apktool: A tool for reverse engineering android505

apk ﬁles, Retrieved February 10 (2012) 2020.

[22] S. Dong, M. Li, W. Diao, X. Liu, J. Liu, Z. Li, F. Xu, K. Chen, X. Wang,

K. Zhang, Understanding android obfuscation techniques: A large-scale

investigation in the wild, in: International Conference on Security and

Privacy in Communication Systems, Springer, 2018, pp. 172–192.510

[23] D. Maiorca, D. Ariu, I. Corona, M. Aresu, G. Giacinto, Stealth attacks:

An extended insight into the obfuscation eﬀects on android malware, Com-

puters & Security 51 (2015) 16–31.

[24] H. Cho, J. H. Yi, G.-J. Ahn, Dexmonitor: Dynamically analyzing and

monitoring obfuscated android applications, IEEE Access 6 (2018) 71229–515

71240.

[25] A. Apvrille, R. Nigam, Obfuscation in android malware, and how to ﬁght

back, Virus Bulletin (2014) 1–10.

[26] B. Saikoa, Allatori java obfuscator, [Accessed: 09-Apr-2020].

URL http://www.allatori.com/520

[27] Apk protect: Android apk security protection (2013).

URL https://sourceforge.net/projects/apkprotect/

[28] B. Saikoa, Dexguard.

[29] P. Solutions, Dasho: Java & android obfuscator & runtime protection.

[30] L. Licel, Dexprotector–cutting edge obfuscator for android apps, [Accessed:525

09-Apr-2020].

URL https://dexprotector.com/

[31] Mobile protector by gemalto, a thales company.

URL https://thales-protector-oath-sdk.docs.stoplight.io/

releases/5.2.0/general/overview530

[32] G. Square, Proguard, [Accessed: 09-Apr-2020].

URL https://www.guardsquare.com/en/products/proguard

[33] Promon shield — in-app protection & application shielding.

URL https://promon.co

[34] L. Licel, Stringer java obfuscator, [Accessed: 09-Apr-2020].535

URL https://jfxstore.com/stringer/

[35] K. P. Murphy, Machine learning: a probabilistic perspective, MIT press,

2012.

[36] J. wook Jang, H. Kang, J. Woo, A. Mohaisen, H. K. Kim, Andro-autopsy:

Anti-malware system based on similarity matching of malware and malware540

creator-centric information, Digital Investigation 14 (2015) 17 – 35. doi:

http://dx.doi.org/10.1016/j.diin.2015.06.002.

[37] H. J. Kang, J.-w. Jang, A. Mohaisen, H. K. Kim, Androtracker: Creator in-

formation based android malware classiﬁcation system, in: Information Se-

curity Applications-15th International Workshop, WISA, Vol. 8909, 2014.545

[38] D. Arp, M. Spreitzenbarth, M. Hubner, H. Gascon, K. Rieck, C. Siemens,

Drebin: Eﬀective and explainable detection of android malware in your

pocket., in: Ndss, Vol. 14, 2014, pp. 23–26.

[39] M. Parkour, Contagio mobile - mobile malware mini dump (2012).

URL http://contagiominidump.blogspot.com550

[40] S. Millar, N. McLaughlin, J. Martinez del Rincon, P. Miller, Z. Zhao, Dan-

droid: A multi-view discriminative adversarial network for obfuscated an-

droid malware detection, in: Proceedings of the Tenth ACM Conference on

Data and Application Security and Privacy, 2020, pp. 353–364.

[41] H. Fereidooni, M. Conti, D. Yao, A. Sperduti, Anastasia: Android mal-555

ware detection using static analysis of applications, in: 2016 8th IFIP in-

ternational conference on new technologies, mobility and security (NTMS),

IEEE, 2016, pp. 1–5.

[42] E. B. Karbab, M. Debbabi, A. Derhab, D. Mouheb, Cypider: building

community-based cyber-defense infrastructure for android malware detec-560

tion, in: Proceedings of the 32nd Annual Conference on Computer Security

Applications, 2016, pp. 348–362.

[43] E. Mariconti, L. Onwuzurike, P. Andriotis, E. De Cristofaro, G. Ross,

G. Stringhini, Mamadroid: Detecting android malware by building markov

chains of behavioral models, arXiv preprint arXiv:1612.04433.565

[44] A. Feizollah, N. B. Anuar, R. Salleh, G. Suarez-Tangil, S. Furnell, An-

drodialysis: Analysis of android intent eﬀectiveness in malware detection,

computers & security 65 (2017) 121–134.

[45] J. Garcia, M. Hammad, S. Malek, Lightweight, obfuscation-resilient detec-

tion and family identiﬁcation of android malware, ACM Transactions on570

Software Engineering and Methodology (TOSEM) 26 (3) (2018) 1–29.

[46] A. Machiry, N. Redini, E. Gustafson, Y. Fratantonio, Y. R. Choe,

C. Kruegel, G. Vigna, Using loops for malware classiﬁcation resilient to

feature-unaware perturbations, in: Proceedings of the 34th Annual Com-

puter Security Applications Conference, 2018, pp. 112–123.575

[47] H. Alshahrani, H. Mansourt, S. Thorn, A. Alshehri, A. Alzahrani, H. Fu,

Ddefender: Android application threat detection using static and dynamic

analysis, in: 2018 IEEE International Conference on Consumer Electronics

(ICCE), IEEE, 2018, pp. 1–6.

[48] M. Alazab, Automated malware detection in mobile app stores based on580

robust feature generation, Electronics 9 (3) (2020) 435.

[49] R. Nigam, A timeline of mobile botnets, Virus Bulletin, March.

[50] G. Suarez-Tangil, G. Stringhini, Eight years of rider measurement in the

android malware ecosystem: evolution and lessons learned, arXiv preprint

arXiv:1801.08115.585

[51] F. C. Freiling, M. Protsenko, Y. Zhuang, An empirical evaluation of soft-

ware obfuscation techniques applied to android apks, in: International Con-

ference on Security and Privacy in Communication Networks, Springer,

2014, pp. 315–328.

[52] M. K¨uhnel, M. Smieschek, U. Meyer, Fast identiﬁcation of obfuscation590

and mobile advertising in mobile malware, in: 2015 IEEE Trustcom/Big-

DataSE/ISPA, Vol. 1, IEEE, 2015, pp. 214–221.

[53] V. Rastogi, Y. Chen, X. Jiang, Catch me if you can: Evaluating android

anti-malware against transformation attacks, IEEE Transactions on Infor-

mation Forensics and Security 9 (1) (2013) 99–108.595

[54] M. Zheng, P. P. Lee, J. C. Lui, Adam: an automatic and extensible platform

to stress test android anti-virus systems, in: International conference on de-

tection of intrusions and malware, and vulnerability assessment, Springer,

2012, pp. 82–101.

[55] T. Cho, H. Kim, J. H. Yi, Security assessment of code obfuscation based on600

dynamic monitoring in android things, IEEE Access 5 (2017) 6361–6371.

[56] J. Hoﬀmann, T. Rytilahti, D. Maiorca, M. Winandy, G. Giacinto, T. Holz,

Evaluating analysis tools for android apps: Status quo and robustness

against obfuscation, in: Proceedings of the Sixth ACM Conference on Data

and Application Security and Privacy, 2016, pp. 139–141.605

[57] D. Maier, T. M¨uller, M. Protsenko, Divide-and-conquer: Why android

malware cannot be stopped, in: 2014 Ninth International Conference on

Availability, Reliability and Security, IEEE, 2014, pp. 30–39.

[58] J. Park, H. Kim, Y. Jeong, S.-j. Cho, S. Han, M. Park, Eﬀects of code

obfuscation on android app similarity analysis., JoWUA 6 (4) (2015) 86–610

98.

[59] V. Balachandran, D. J. Tan, V. L. Thing, et al., Control ﬂow obfuscation

for android applications, Computers & Security 61 (2016) 72–93.

[60] V. Haupert, D. Maier, N. Schneider, J. Kirsch, T. M¨uller, Honey, i shrunk

your app security: The state of android app hardening, in: International615

Conference on Detection of Intrusions and Malware, and Vulnerability As-

sessment, Springer, 2018, pp. 69–91.

[61] P. Faruki, A. Bharmal, V. Laxmi, M. S. Gaur, M. Conti, M. Rajarajan,

Evaluation of android anti-malware techniques against dalvik bytecode ob-

fuscation, in: 2014 IEEE 13th International Conference on Trust, Security620

and Privacy in Computing and Communications, IEEE, 2014, pp. 414–421.

[62] Z. Kan, H. Wang, L. Wu, Y. Guo, D. X. Luo, Automated deobfuscation of

android native binary code, arXiv preprint arXiv:1907.06828.

[63] Y. Moses, Y. Mordekhay, Android app deobfuscation using static-dynamic

cooperation, VB2018.625

[64] B. Bichsel, V. Raychev, P. Tsankov, M. Vechev, Statistical deobfuscation of

android applications, in: Proceedings of the 2016 ACM SIGSAC Conference

on Computer and Communications Security, 2016, pp. 343–355.

[65] D. Hang, N.-q. HE, H. Ge, L. Qi, M. ZHANG, Malware detection method

of android application based on simpliﬁcation instructions, The Journal of630

China Universities of Posts and Telecommunications 21 (2014) 94–100.

[66] T. Chen, Q. Mao, Y. Yang, M. Lv, J. Zhu, Tinydroid: a lightweight and

eﬃcient model for android malware detection and classiﬁcation, Mobile

information systems 2018.

[67] G. Canfora, A. De Lorenzo, E. Medvet, F. Mercaldo, C. A. Visaggio, Eﬀec-635

tiveness of opcode ngrams for detection of multi family android malware,

in: 2015 10th International Conference on Availability, Reliability and Se-

curity, IEEE, 2015, pp. 333–340.

[68] S. Hahn, M. Protsenko, T. M¨uller, Comparative evaluation of machine

learning-based malware detection on android., Sicherheit 2016-Sicherheit,640

Schutz und Zuverl¨assigkeit.

[69] N. McLaughlin, J. Martinez del Rincon, B. Kang, S. Yerima, P. Miller,

S. Sezer, Y. Safaei, E. Trickel, Z. Zhao, A. Doup´e, et al., Deep android

malware detection, in: Proceedings of the Seventh ACM on Conference on

Data and Application Security and Privacy, 2017, pp. 301–308.645

[70] V. Sihag, A. Mitharwal, M. Vardhan, P. Singh, Opcode n-gram based mal-

ware classiﬁcation in android, in: 2020 Fourth World Conference on Smart

Trends in Systems, Security and Sustainability (WorldS4), IEEE, 2020, pp.

645–650.

[71] A. Ali-Gombe, I. Ahmed, G. G. Richard III, V. Roussev, Opseq: Android650

malware ﬁngerprinting, in: Proceedings of the 5th Program Protection and

Reverse Engineering Workshop, 2015, pp. 1–12.

Smali opcode based Android Malware detection and Obfuscation Identification

Preprint

Full-text available

Oct 2023

The Android platform's open-source nature makes it a prime target for attackers seeking to exploit vulnerabilities. The practice of reverse engineering in Android applications further increases this vulnerability, creating a lucrative ground for exploitation and attack. Malware developers use various obfuscation techniques to protect applications from reverse engineering attempts. These same obfuscation techniques are utilized by malware creators to hide malicious code within the application's structure. Obfuscation introduces useless code and concealed features during feature extraction, making it difficult for conventional malware analysis methods to recognise the application and resulting in a high rate of false negatives. To address this, this paper introduces an innovative Smali opcode-based model, specifically designed to address the complexity of obfuscation techniques during both binary and familial classification. The core objective is to design a lightweight model capable of classifying malware and benign applications, alongside robust familial classification. Moreover, the model is also equipped to identify the specific obfuscation technique employed in a given malware application. We have meticulously implemented and rigorously evaluated the proposed model using two distinct datasets encompassing obfuscated and non-obfuscated samples. The experimental findings affirm the model's performance, surpassing existing state-of-the-art Android malware classifiers. Notably, the model achieves an impressive binary classification accuracy of 99.4\%.

Android ransomware detection using a novel hamming distance based feature selection

Article

Full-text available

Aug 2023

Ransomware is a serious cyberthreat for Android users, with devastating consequences for its victims. By locking or encrypting the targeted device, victims are often left unable to access their data, with attackers demanding payment in bitcoins in exchange for decryption. These attacks can occur across various sectors, including government, business, and health systems. Therefore, effective measures to mitigate this threat are critical. This paper proposes a novel hamming distance-based feature selection technique for detecting Android ransomware through static analysis. The detection approach involves four phases: feature extraction, binary feature vector generation, feature selection, and classification. A Python tool is used to automatically extract static features from Android applications, which are then processed for feature vector generation and selection. The effectiveness of the proposed technique is evaluated using various experiments, including machine learning and deep learning techniques. In addition, this article outlines a threat scenario of ransomware on the Android platform. The proposed system achieves a maximum detection accuracy of 99% with Random Forest and Decision Tree classifiers, surpassing state-of-the-art studies. Overall, the proposed technique offers an efficient solution for detecting Android ransomware, which could help prevent future attacks and reduce the impact of this serious cyberthreat.

A temporal analysis and evaluation of fuzzy hashing algorithms for Android malware analysis

Article

Jun 2024

AndroPack: A Hybrid Method To Detect Packed Android Malware With Ensemble Learning

Conference Paper

Apr 2024

Detecting Android Malware by Mining Enhanced System Call Graphs

Article

Full-text available

Apr 2024

The persistent threat of malicious applications targeting Android devices has been growing in numbers and severity. Numerous techniques have been utilized to defend against this thread, including heuristic-based ones, which are able to detect unknown malware. Among the many features that this technique uses are system calls. Researchers have used several representation methods to capture system calls, such as histograms. However, some information may be lost if the system calls as a feature is only represented as a 1-dimensional vector. Graphs can represent the interaction of different system calls in an unusual or suspicious way, which can indicate malicious behavior. This study uses machine learning algorithms to recognize malicious behavior represented in a graph. The system call graph was fed into machine learning algorithms such as AdaBoost, Decision Table, Naïve Bayes, Random Forest, IBk, J48, and Logistic regression. We further employ a series feature selection method to improve detection accuracy and eliminate computational complexity. Our experiment results show that the proposed method has reduced feature dimension to 91.95% and provides 95.32% detection accuracy.

Android Malware Detection and Identification Frameworks by Leveraging the Machine and Deep Learning Techniques: A Comprehensive Review

Article

Mar 2024

Towards a DeepMalOb Improvement in the Use of Formal Security Risk Analysis Methods

Conference Paper

Nov 2023

A Survey and Evaluation of Android-Based Malware Evasion Techniques and Detection Frameworks

Article

Full-text available

Jun 2023

Android platform security is an active area of research where malware detection techniques continuously evolve to identify novel malware and improve the timely and accurate detection of existing malware. Adversaries are constantly in charge of employing innovative techniques to avoid or prolong malware detection effectively. Past studies have shown that malware detection systems are susceptible to evasion attacks where adversaries can successfully bypass the existing security defenses and deliver the malware to the target system without being detected. The evolution of escape-resistant systems is an open research problem. This paper presents a detailed taxonomy and evaluation of Android-based malware evasion techniques deployed to circumvent malware detection. The study characterizes such evasion techniques into two broad categories, polymorphism and metamorphism, and analyses techniques used for stealth malware detection based on the malware’s unique characteristics. Furthermore, the article also presents a qualitative and systematic comparison of evasion detection frameworks and their detection methodologies for Android-based malware. Finally, the survey discusses open-ended questions and potential future directions for continued research in mobile malware detection.

Android malware analysis and detection: A systematic review

Article

Oct 2023
EXPERT SYST

Android malware has been emerged as a significant threat, which includes exposure of confidential information, misrepresentation of facts and execution of applications without the knowledge of the users. Malware analysis plays an essential role in dealing with the unlawful behaviour of such malicious applications. Android malware analysis involves examining and understanding malware behaviour and its characteristics. It also includes potential adversarial impacts on Android devices. This paper presents a quick understanding and a holistic view of malware detection and analysis. The current investigation conducted a systematic literature review (SLR) to recognize the salient shifts in malware detection by examining a range of scholarly journals and conference papers. The SLR investigated 99 articles published between the years 2018 and 2023. The key observation of this SLR is that static analysis is the most implemented approach for detecting Android malware; Apktool and Androguard are the most frequently used tools. This study also conceded that deep learning and machine learning models have more potential to analyse the malicious behaviour of malware. Certain challenges are faced in Android malware analysis, that is, obfuscation techniques, dynamic code loading, and issues related to experimented datasets. Further, this study focuses on the following areas: the definition of the sample set, data optimisation and processing, feature extraction, machine learning application, and classifier validation. This investigation differs from previous analyses of Android malware detection by emphasizing additional methods based on machine learning.

Obfuscated Malware Detection: Impacts on Detection Methods

Chapter

Sep 2023

Obfuscated malware poses a challenge to traditional malware detection methods as it uses various techniques to disguise its behavior and evade detection. This paper focuses on the impacts of obfuscated malware detection techniques using a variety of detection methods. Furthermore, this paper discusses the current state of obfuscated malware, the methods used to detect it, and the limitations of those methods. The impact of obfuscation on the effectiveness of detection methods is also discussed. An approach for the creation of advanced detection techniques based on machine learning algorithms is offered, along with an empirical examination of malware detection performance assessment to battle obfuscated malware. Overall, this paper highlights the importance of staying ahead of the constantly evolving threat landscape to safeguard computer networks and systems.

A survey of android application and malware hardening

Article

Full-text available

Feb 2021

In the age of increasing mobile and smart connectivity, malware poses an ever evolving threat to individuals, societies and nations. Anti-malware companies are often the first and only line of defense for mobile users. Driven by economic benefits, quantity and complexity of Android malware are increasing, thus making them difficult to detect. Malware authors employ multiple techniques (e.g. code obfuscation, packaging and encryption) to evade static analysis (signature based) and dynamic analysis (behavior based) detection methods. In this article, we present an overview of Android and its state of the art security services. We then present an exhaustive and analytic taxonomy of Android malware hardening techniques available in the literature. Furthermore, we review and analyze the code obfuscation and preventive techniques used by malware to evade detection. Hardening mechanisms are also popular amongst application developers to fortify against reverse engineering. Based on our in-depth survey, we highlight the issues related to them and manifest future directions. We believe the need to examine the effectiveness and efficiency of hardening techniques and their combination.

Opcode n-gram based Malware Classification in Android

Conference Paper

Full-text available

Jul 2020

Signature Based Malicious Behavior Detection in Android

Chapter

Full-text available

Jul 2020

User’s security and privacy are of increasing concern with the popularity of Android and its applications. Apps of malicious nature attempts to perform activities like information leakage and user profiling, detection of which is a challenge for security researchers. In this paper, we try to solve this problem by proposing a behavior based approach to detect malicious nature of applications in Android. Events and behavioral activities of an application are used to generate signature, which then is matched with signature database for detection. Behavioral signatures are designed on the basis of information leakage attempt, jailbreak attempt, abuse of root privilege and access of critical permissions. 260 popular apps of different nature were evaluated in addition to 42 android apps, which were flagged malicious by Government of India. The proposed system shows promising results to detect malicious behaviors. It also defines the nature of malicious activity exploited by an app.

Eight Years of Rider Measurement in the Android Malware Ecosystem

Article

Full-text available

Mar 2020

Despite the growing threat posed by Android malware, the research community is still lacking a comprehensive view of common behaviors and emerging trends in malware families active on the platform. Without such view, researchers incur the risk of developing systems that only detect outdated threats, missing the most recent ones. In this paper, we conduct the largest measurement of Android malware behavior to date, analyzing over 1.2 million malware samples that belong to 1.28K families over a period of eight years (from 2010 to 2017). We aim at understanding how Android malware has evolved over time, focusing on repackaging malware. In this type of threat different innocuous apps are piggybacked with a malicious payload (rider), allowing inexpensive malware manufacturing. One of the main challenges posed when studying repackaged malware is slicing the app to split benign components apart from the malicious ones. To address this problem, we use differential analysis to isolate software components that are irrelevant to the campaign and study the behavior of malicious riders alone. Our analysis framework relies on collective repositories and recent advances on the systematization of intelligence extracted from multiple anti-virus vendors. We find that since its infancy in 2010, the Android malware ecosystem has changed significantly, both in the type of malicious activity performed by malware and in the level of obfuscation used to avoid detection. Finally, we discuss what our findings mean for Android malware detection research, highlighting areas that need further attention by the research community. In particular, we show that riders of malware families evolve over time. This evidences important experimental bias in research works levering on automated systems for family identification without considering variants.

Automated Malware Detection in Mobile App Stores Based on Robust Feature Generation

Article

Full-text available

Mar 2020

Moutaz Alazab

Many Internet of Things (IoT) services are currently tracked and regulated via mobile devices, making them vulnerable to privacy attacks and exploitation by various malicious applications. Current solutions are unable to keep pace with the rapid growth of malware and are limited by low detection accuracy, long discovery time, complex implementation, and high computational costs associated with the processor speed, power, and memory. Therefore, an automated intelligence technique is necessary for detecting apps containing malware and effectively predicting cyberattacks in mobile marketplaces. In this study, a system for classifying mobile marketplaces applications using real-world datasets is proposed, which analyzes the source code to identify malicious apps. A rich feature set of application programming interface (API) calls is proposed to capture the regularities in apps containing malicious content. Two feature-selection methods—Chi-Square and ANOVA—were examined in conjunction with ten supervised machine-learning algorithms. The detection accuracy of each classifier was evaluated to identify the most reliable classifier for malware detection using various feature sets. Chi-Square was found to have a higher detection accuracy as compared to ANOVA. The proposed system achieved a detection accuracy of 98.1% with a classification time of 1.22 s. Furthermore, the proposed system required a reduced number of API calls (500 instead of 9000) to be incorporated as features.

Understanding Android Obfuscation Techniques: A Large-Scale Investigation in the Wild

Conference Paper

Full-text available

Aug 2018

Program code is a valuable asset to its owner. Due to the easy-to-reverse nature of Java, code protection for Android apps is of particular importance. To this end, code obfuscation is widely utilized by both legitimate app developers and malware authors, which complicates the representation of source code or machine code in order to hinder the manual investigation and code analysis. Despite many previous studies focusing on the obfuscation techniques, however, our knowledge of how obfuscation is applied by real-world developers is still limited. In this paper, we seek to better understand Android obfuscation and depict a holistic view of the usage of obfuscation through a large-scale investigation in the wild. In particular, we focus on three popular obfuscation approaches: identifier renaming, string encryption and Java reflection. To obtain the meaningful statistical results, we designed efficient and lightweight detection models for each obfuscation technique and applied them to our massive APK datasets (collected from Google Play, multiple third-party markets, and malware databases). We have learned several interesting facts from the result. For example, more apps on third-party markets than malware use identifier renaming, and malware authors use string encryption more frequently. We are also interested in the explanation of each finding. Therefore we carry out in-depth code analysis on some Android apps after sampling. We believe our study will help developers select the most suitable obfuscation approach, and in the meantime help researchers improve code analysis systems in the right direction.

DexMonitor: Dynamically Analyzing and Monitoring Obfuscated Android Applications

Article

Full-text available

Nov 2018

Both Android application developers and malware authors use sophisticated obfuscation tools to prevent their mobile applications from being repackaged and analyzed. These tools obfuscate sensitive strings and classes, API calls, and control flows in the Dalvik bytecode. Consequently, it is inevitable for the security analysts to spend significant amount of time for understanding the robustness of these obfuscation techniques and fully comprehending the intentions of each application. Since such analyses are often errorprone and require extensive analysis experience, it is critical to explore a novel approach to systematically analyze Android application bytecode. In this paper, we propose an approach to address such a critical challenge by placing hooks in the Dalvik virtual machine at the point where a Dalvik instruction is about to be executed. Also, we demonstrate the effectiveness of our approach through case studies on real-world applications with our prototype, called DexMonitor.

TinyDroid: A Lightweight and Efficient Model for Android Malware Detection and Classification

Article

Full-text available

Oct 2018

With the popularity of Android applications, Android malware has an exponential growth trend. In order to detect Android malware effectively, this paper proposes a novel lightweight static detection model, TinyDroid , using instruction simplification and machine learning technique. First, a symbol-based simplification method is proposed to abstract the opcode sequence decompiled from Android Dalvik Executable files. Then, N-gram is employed to extract features from the simplified opcode sequence, and a classifier is trained for the malware detection and classification tasks. To improve the efficiency and scalability of the proposed detection model, a compression procedure is also used to reduce features and select exemplars for the malware sample dataset. TinyDroid is compared against the state-of-the-art antivirus tools in real world using Drebin dataset. The experimental results show that TinyDroid can get a higher accuracy rate and lower false alarm rate with satisfied efficiency.

DANdroid: A Multi-View Discriminative Adversarial Network for Obfuscated Android Malware Detection

Conference Paper

Mar 2020

Using Loops For Malware Classification Resilient to Feature-unaware Perturbations

Conference Paper

Dec 2018

In the past few years, both the industry and the academic communities have developed several approaches to detect malicious Android apps. State-of-the-art research approaches achieve very high accuracy when performing malware detection on existing datasets. These approaches perform their malware classification tasks in an "offline" scenario, where malware authors cannot learn from and adapt their malicious apps to these systems. In real-world deployments, however, adversaries get feedback about whether their app was detected, and can react accordingly by transforming their code until they are able to influence the classification. In this work, we propose a new approach for detecting Android malware that is designed to be resilient to feature-unaware perturbations without retraining. Our work builds on two key ideas. First, we consider only a subset of the codebase of a given app, both for precision and performance aspects. For this paper, our implementation focuses exclusively on the loops contained in a given app. We hypothesize, and empirically verify, that the code contained in apps' loops is enough to precisely detect malware. This provides the additional benefits of being less prone to noise and errors, and being more performant. The second idea is to build a feature space by extracting a set of labels for each loop, and by then considering each unique combination of these labels as a different feature: The combinatorial nature of this feature space makes it prohibitively difficult for an attacker to influence our feature vector and avoid detection, without access to the specific model used for classification. We assembled these techniques into a prototype, called LoopMC, which can locate loops in applications, extract features, and perform classification, without requiring source code. We used LoopMC to classify about 20,000 benign and malicious applications. While focusing on a smaller portion of the program may seem counterintuitive, the results of these experiments are surprising: our system achieves a classification accuracy of 99.3% and 99.1% for the Malware Genome Project and VirusShare datasets respectively, which outperforms previous approaches. We also evaluated LoopMC, along with the related work, in the context of various evasion techniques, and show that our system is more resilient to evasion.

BLADE: Robust malware detection against obfuscation in android

Abstract and Figures

Recommended publications

Understanding Android Obfuscation Techniques: A Large-Scale Investigation in the Wild

Understanding Android Obfuscation Techniques: A Large-Scale Investigation in the Wild

A survey of android application and malware hardening

De-LADY: Deep learning based Android malware detection using Dynamic features