ArticlePDF Available

Color images steganalysis using rgb channel geometric transformation measures

Security and Communication Networks

October 2016
9(15):n/a-n/a

DOI:10.1002/sec.1427

Authors:

Hassan Abdulrahman

Northern Technical University

Marc Chaumont

Laboratoire d'Informatique, de Robotique et de Microélectronique de Montpellier (LIRMM)

Philippe Montesinos

Université de Montpellier - IMT Mines Ales

Magnier Baptiste

Université de Montpellier

In recent years, information security has received a great deal of attention. To give an example, steganography techniques are used to communicate in a secret and invisible way. Digital color images have become a good medium for digital steganography because of their easy manipulation as carriers via Internet, e-mails, or used on websites. The main goal of steganalysis is to detect the presence of hidden messages in a digital media. The proposed method is a further extension of the authors' previous work: steganalysis based on color feature correlation and machine learning classification. Fusing features with those obtained from color-rich models allows increasing the detectability of hidden messages in the color images. Our new proposition uses two types of features, computed between color image channels. The first type of feature reflects local Euclidean transformations, and the second one reflects mirror transformations. These geometric measures are obtained by the sine and cosine of gradient angles between all the color channels. Features are extracted from co-occurrence correlation matrices of measures. We demonstrate the efficiency of the proposed framework on three steganography algorithms designed to hide messages in images represented in the spatial domain: S-UNIWARD, WOW, and Synch-HILL. For each algorithm, we applied a range of different payload sizes. The efficiency of the proposed method is demonstrated by the comparison with the previous authors work and the spatial color-rich model and color filter array-aware features for steganalysis.

Basic steganography model.

…

Features extraction: Cosine of the gradient angles [8].

…

Rotation angle between two channel gradients cos(α1)= cos(α2) but sin(α1)=− sin(α2)cos(θ1)= cos(θ2) but sin(θ1)=− sin(θ2). Sine is essential to determine the direction of the rotation.

…

Features of extraction: sine of the gradients angles extracting information from the direction of the local rotation.

…

Sample images of our database built by random cropping from locations of red channel pixels (even position) in a Bayer pattern : (a) Original raw image 3906 × 2602, (b) crop 1 position x = 2116,y = 1928, (c) crop 2 position x = 902, y = 1182, (d) crop 3 position x = 3080,y = 436, (e) crop 4 position x = 1866,y = 1778, (f) crop 5 position x = 650, y = 1032.

…

Figures - uploaded by Marc Chaumont

Content may be subject to copyright.

Content uploaded by Marc Chaumont

Content may be subject to copyright.

SECURITY AND COMMUNICATION NETWORKS

Security Comm. Networks 2015; 00:1–12

DOI: 10.1002/sec

RESEARCH ARTICLE

Color Images Steganalysis

Using RGB Channel Geometric Transformation Measures

Hasan ABDULRAHMAN 2,4, Marc CHAUMONT 1,2,3, Philippe MONTESINOS 4

and Baptiste MAGNIER 4

1Nˆ

ımes University, Place Gabriel P ´

eri, 30000 Nˆ

ımes Cedex 1, France.

2Montpellier University, UMR5506-LIRMM, 34095 Montpellier Cedex 5, France.

3CNRS, UMR5506-LIRMM, 34392 Montpellier Cedex 5, France.

4Ecole des Mines d’Al`

es, LGI2P, Parc Scientiﬁque G.Besse, 30035 Nˆ

ımes Cedex 1, France.

ABSTRACT

In recent years, information security has received a great deal of attention. To give an example, steganography techniques

are used to communicate in a secret and invisible way. Digital color images have became a good medium for digital

steganography due to their easy manipulation as carriers via Internet, e-mails, or used on websites. The main goal of

steganalysis is to detect the presence of hidden messages in a digital media. The proposed method is a further extension of

the authors previous work: steganalysis based on color feature correlation and machine learning classiﬁcation. Fusing

features with those obtained from Color-Rich Models allows increasing the detectability of hidden messages in the

color images. Our new proposition uses two types of features, computed between color image channels. The ﬁrst type

of feature reﬂects local Euclidean transformations and the second one reﬂects mirror transformations. These geometric

measures are obtained by the sine and cosine of gradient angles between all the color channels. Features are extracted

from co-occurrence correlation matrices of measures. We demonstrate the efﬁciency of the proposed framework on three

steganography algorithms designed to hide messages in images represented in the spatial domain: S-UNIWARD, WOW,

and Synch-HILL. For each algorithm, we applied a range of different payload sizes. The efﬁciency of the proposed method

is demonstrated by the comparison with the previous authors work and the Spatial Color Rich Model and CFA-Aware

features for steganalysis.

2015 John Wiley & Sons, Ltd.

KEYWORDS

Steganalysis; Color Spatial Rich Model; CFA-aware steganalysis; channel correlation; ensemble classiﬁer; steganography.

Received . . .

1. INTRODUCTION

Steganalysis, the art of detecting hidden information, has

received a great deal of attention in recent years. There

are many researchers working on solutions ensuring the

detection of hidden messages inside digital media. As

a result, there are many techniques and methods that

are currently used in the ﬁeld of steganography and

steganalysis [1].

Modern information security techniques demonstrate

that cryptography alone is not enough to ensure the

safe communication of a hidden message. Indeed, it is

simple to corrupt, sabotage or delete a ﬁle containing

secret/encrypted message, as they may be tracked. In

addition, the presence of encrypted information itself is

valuable information. Additionally, when any person ﬁnds

and sees an encrypted message, this makes possible its

decryption. For these reasons, it is common to work with

steganography, encrypting the messages, and then hiding

them in a digital medium. By the way, steganography is

not intended to replace cryptography but supplement it to

make the detectability of the secret messages more and

more difﬁcult [2].

More speciﬁcally, steganography is the art of hiding

the presence of a communication, by embedding messages

within a media such as audio, image or video ﬁles, in a

way that is hard to detect. The steganographer objective is

thus to hide the fact that there are information, hidden in a

media [3].

Image steganography techniques based on the modiﬁ-

cation are predominantly classiﬁed into the spatial and fre-

quency domains [4]. In the spatial domain, pixel values are

2015 John Wiley & Sons, Ltd. 1

Prepared using secauth.cls [Version: 2010/06/28 v2.00]

A demonstration of the Security Comm. Networks class ﬁle A. N. Other

Data Extraction Data Embedding

Secret Message

Stego Image Cover Image

Secret Message

Embedding Process Extraction Process

Figure 1. Basic Steganography Model.

used directly to embed the message bits. In the frequency

domains [5], the frequential coefﬁcients are used to embed

the message. Each domain has several different algorithms.

Generally, steganography is made of two parts, messages

are embedded inside the digital media in the ﬁrst part (the

embedding) and they are extracted in the second part (the

extraction) [6], as illustrated in Fig.1.

Although the embedded messages inside the digital

medium involves some slight changes in this medium,

these changes modify slight coefﬁcient values of the image

[18]. These changes are difﬁcult to identify by a common

user. On the other hand, steganalysis research aims to

develop some methods, theories and applications that are

effectively able to detect these minor modiﬁcations in

order to detect hidden messages in this medium. Although,

the real-world uses signiﬁcantly more color images than

grayscale images, there is a lot of research in steganalysis

of grayscale images compared to color images [7].

In this article, we describe further extensions of the

recent method described by Addulrahman et al. [8]. We

propose new features to enhance the Color Rich Model [9],

which is formed by co-occurrence matrices of residuals

taken across the color channels.

The rest of this paper is organized as follows.

Section 2is dedicated to steganalysis methods for digital

color images. Section 2.1 describes Color Spatial Model

steganalysis, and Section 2.2 describes Color Filter Array

aware features for steganalysis [22]. We present a detailed

description of our proposed method in Section 3by

recalling the color channel correlation and the mirror

transformations. The ensemble classiﬁer used in this

work is explained in Section 4. Experimental results and

comparisons are given in Section 5. Finally, Section 6gives

some conclusions and perspectives.

2. RELATED WORK

In recent years, there have been a few techniques involving

color steganalysis methods. In this regard, the earliest work

was reported by Fridrich et al. [7]. The authors have

developed an inﬂuential approach for color steganalysis

to detect stego images which are created by embedding

a message inside the pixels randomly (using the Last

Signiﬁcant Bit (LSB) steganography method). They

have found the relative number of close colors pairs

between the original image and the stego-image. Let us

note (R1, G1, B1) and (R2, G2, B 2) respectively the

three color channels specifying the red, green and blue

components of two color images. They show that if two

colored pixels (R1, G1, B1) and (R2, G2, B2) for these

two images are close, then the condition (R1−R2)2+

(G1−G2)2+ (B1−B2)2≤3must be satisﬁed. Thus,

after the embedding process, the number of unique colors

will be increased in stego images more than the number of

unique colors in the cover image.

Fridrich et al. [10] have introduced a reliable

steganalysis algorithm to detect LSB embedding in

randomly non-sequential scattered pixels in both 24-bit

color and grayscale images. In this method, the message

length is derived by searching the lossless capacity

in the LSB. Additionally, Westfeld and Pﬁtzman [11]

have applied a robust statistical attack method based

on statistical analysis of Pairs of Values (P oV s) that

are exchanged during message embedding. This method

detects very reliable stego images with hidden messages

which are embedded in sequential pixels using EZ Stego,

S-tools, J-Steg, and Steganos methods. A set of P oV s

is used to detect the presence of secret messages in

digital images. However, this method is not efﬁcient with

embedded messages in random pixels.

Ker, in [12], enhanced techniques for the detection of

LSB Matching method from grayscale images into the

color images experiment. Beginning with the outlining of

Harmsen method [13], by using Histogram Characteristic

Function (HC F ), Ker has described two new ways: the

ﬁrst one by calibrating the output Center Of Mass (COM)

using a down sampled image, and the second way by

computing the adjacency histogram instead of the usual

histogram to detect an additive noise based steganography.

Thiyagarajan et al. [14] has developed a steganalysis

method based on color model conversion. Indeed,

considering color images, to detect hidden messages, they

convert Red (R), Green (G) and Blue (B) channels of the

images to the Hue Saturation and Intensity (HS I) color

model. Stego images are then generated by implementing

different color image formats, using the last signiﬁcant bit

steganography method. Finally, cover and stego images

are recognized using a threshold value which depends

on the correlation between pixel pairs in terms of color

components.

Lyu et al. [15] have described a steganalysis algorithm

that exploits the inherent statistical regularities of the

original images. The statistical model consists of ﬁrst and

higher order color wavelet statistics of noise residuals

obtained using predictors of coefﬁcients in Quadratic

Mirror Filter (QM F ) decomposition of the image from

all three color channels. Finally, they estimate that

2Security Comm. Networks 2015; 00:1–12 c

2015 John Wiley & Sons, Ltd.

DOI: 10.1002/sec

Prepared using secauth.cls

A. N. Other A demonstration of the Security Comm. Networks class ﬁle

the addition of color statistics provides considerable

improvement in overall detection accuracy.

Krichner et al. [16] proposed a steganalysis method to

detect LSB replacement steganography in color images.

Also, the authors have enhanced the Weighted Stego (W S)

image steganalysis method [17] by replacing the cover

predictor in W S with position speciﬁc predictors, to detect

stego images produced from covers that exhibit traces of

Color Filter Array (CF A) interpolation. This technique

explains the local predictability of pixels, depending on

their position in the (CFA) interpolation to compute the

differences between cover images and stego images. The

detector exploits only dependencies within a color channel

due to color interpolation at cover generation.

Olguin-Garcia et al. [19] have developped a new

approach for color image steganalysis depending on

Histogram Characteristic Function Center of Mass

(HC F CoM )detecting histogram changes in each

R, G,and Bchannels. The stego images are created by

using LSB Matching steganography method. Then, the

Probability Density Function (P DF )is computed to ﬁnd

the adequate threshold, and different threshold values are

determined with different payloads.

The most recent and efﬁcient methods in color image

steganalysis are explained in detail in the two following

sections.

2.1. Color Spatial Rich Model steganalysis

As it is well known, embedding a message in an image

modiﬁes some pixel values. Indeed, this modiﬁcation

provides slight changes to the pixel values where the

message is embedded. It is a difﬁcult task to detect and

extract the sensitive features. Many methods apply high-

pass ﬁlters to the target image, and then compute high

order statistics on the ﬁltered images. Goljan et al. [9]

have introduced efﬁcient color image features which are

an extension of the Spatial Rich Model [18], produced

from two different sets of features. First of all, this

method extracts the noise residual from each color channel

separately. Let us note that Xij is a pixel value of an 8-bit

grayscale cover image. We can specify the red, green and

blue channel of color images by the following formula:

Rij =ˆ

Xij (Nij )−c·Xij ,(1)

where:

•c∈N, is the residual order,

• Nij , is a local neighborhood of pixel Xij

at coordinates (i, j),

•ˆ

Xij (·)is a predictor of c·Xij ,Xij 6∈ Nij ,

Xij ∈ {0, ...., 255}.

Many diverse submodels built from the differences

between neighboring pixels are combined in the Rich

Model, all of the submodels (Rij )∈Rn1×n2are formed

from noise residual images of size n1×n2computed using

high pass ﬁlters of the following form:

Rij ←trancTround Rij

q,(2)

where:

•Rij =(trancT(x) = xfor x∈[−T, T ],

trancT(x) = T·sign(x)otherwise.

•qis the quantization step,

•round is a function for rounding

to an integer value.

The Spatio-Color Rich Model consists of two different

components. On one hand, the Spatial Rich Model

(SRM Q1) [18] with a ﬁxed quantization q= 1 and

truncation T= 2 yields a dimensionality of 12753

features. These features are computed from each R,

Gand Bcolor channel separately. Finally, the three

dimensionality features are added together to keep the

same dimensionality as for grayscale images. On the other

hand, from the same noise residuals (i.e. SRM Q1), the

CRM Q1builds a collection of 3D color co-occurrence

matrices, taking three color values at the same position

(across the three channels of each pixel). Thus, with

ﬁxed truncation T= 3 and quantization q= 1,CRM Q1

produces 5404 features per image.

2.2. CFA-aware features steganalysis

Digital cameras capture color images using a single

sensor in conjunction with a Color Filter Array (CF A)

interpolation. The CF A allows us to capture only one

part of the spectrum though the sensor so that only one

color is measured at each pixel (red, blue or green) and

so the resulting images are called mosaic images. To

construct a color image, a demosaicking algorithm is

used in order to interpolate each color plane (i.e. CF A

interpolations). Several patterns exist for the color ﬁlter

array, with the most common being Bayer CF A [20].

During this process, the green color channel is the most

important factor which determines the luminance of the

color image, 50% of the pixels in the Bayer CF A structure

are assigned to the green channel, while 25% are assigned

to the red channel and 25% to the blue color channel [21].

Goljan et al. introduced in [22] the CFA-aware CRM

for color image steganalysis. The features are made

from two parts, the ﬁrst one is the Color Rich Model

CRM Q1explained in section 2.1 with T∈ {2,3}. The

second part is the CFA-aware feature, which consists of

three combinations: RB/GG split,R/B /GG split and

NII/I N I split.

Let us note, if Xhas a true-color image size of n1×n2,

where n1and n2are even numbers, (0≤i < n1,0≤j <

n2). Considering a typical Bayer mosaic, the Gsub-image

has twice as many pixels as the Rand Bsub-images. We

Security Comm. Networks 2015; 00:1–12 c

2015 John Wiley & Sons, Ltd. 3

DOI: 10.1002/sec

Prepared using secauth.cls

A demonstration of the Security Comm. Networks class ﬁle A. N. Other

must mention that, all the color images used in this method

are cropped from one pixel position which is the upper left

pixel corresponding to a non-interpolated blue in the Bayer

CF A. The color noise residuals Z=(z(R)

ij ,z(G)

ij ,z(B)

ij ) is

computed as Eq.1, corresponding to CF A used map.

First of all, the following four index sets must be

generated:

XB={(i, j)|ieven, j even},

XG1={(i, j)|iodd, j even},

XG2={(i, j)|ieven, j odd},

XR={(i, j)|iodd, j odd}.

Four 3D co-occurrence matrices are computed from

residual samples due to the above index sets.

C(B)

d1d2d3=X

(i,j)∈XBh(z(R)

ij , z(G)

ij , z(B)

ij ) = (d1, d2, d3)i,

(3)

C(G1)

d1d2d3=X

(i,j)∈XG1h(z(R)

ij , z(G)

ij , z(B)

ij ) = (d1, d2, d3)i,

(4)

C(G2)

d1d2d3=X

(i,j)∈XG2h(z(R)

ij , z(G)

ij , z(B)

ij ) = (d1, d2, d3)i,

(5)

C(R)

d1d2d3=X

(i,j)∈XRh(z(R)

ij , z(G)

ij , z(B)

ij ) = (d1, d2, d3)i.

(6)

From the above four co-occurrence matrices, three

combinations of features are generated to form the total

number of features with the CRM Q1set:

The ﬁrst combination is called RB/GGsplit which

generates 4146 features. C(R)

d1d2d3and C(B)

d1d2d3are treated

and added together, the same thing is applied to C(G1)

d1d2d3

and C(G2)

d1d2d3as in Eq.’s 5and 6.

C(RB)

d1d2d3=C(B)

d1d2d3+C(B)

d3d2d1+C(R)

d1d2d3+C(R)

d3d2d1,

(7)

C(GG)

d1d2d3=C(G1)

d1d2d3+C(G1)

d3d2d1+C(G2)

d1d2d3+C(G2)

d3d2d1.

(8)

R/B/GG split represents the second set and produces

10323 features. This part can be considered as an

important component in this method, because it gives a

considerable number of features. It can be generated from

the concatenation of C(R)

d1d2d3,C(B)

d1d2d3, and C(G1)

d1d2d3+

C(G2)

d1d2d3.

The third set corresponds to the NII /I N I split; ’N’

meaning non-interpolated and ’I’ interpolated respectively,

in the RGB triple. The ’NII’ pixels correspond to the

same set as RB but the two co-occurrence matrices are

directionally symmetrized differently. This set generates

5514 features from two co-occurrence matrices:

C(NI I)

d1d2d3=C(B)

d3d2d1+C(R)

d1d2d3,(9)

C(IN I)

d1d2d3=C(GG)

d1d2d3.(10)

All these features are gathered in a one dimensional

vector, while all detectors are trained as binary classiﬁers

implemented using Kodovsky ensemble classiﬁers [26], as

explained in the following Section 4.

3. FEATURES DESCRIPTION

Our proposition is to enrich the SC RM Q1with an

inter-channel correlation which is composed of three sets

of features. The ﬁrst set, produced by [9], gives 18157

features. The second set, produced by our ﬁrst method [8],

gives 3000 features. Additionally, the third set, produced

by a second method, gives 3000 features; they are obtained

from the new correlation of different R, G and Bchannel

gradients, as shown in Table I.

Table I. Features description with their dimmensionalities

corresponding to qand T.

Feature set SC RM Q1CRG /CRB SRG /SRB

Dim. Symmetry yes yes yes

Dimension 18157 3000 3000

The following section recalls the RGB Channel

Correlations which gives an explanation to our proposition,

then section 4explains the ensemble classiﬁers used in this

approach.

3.1. RGB Channel Correlation

In this section, we introduce an inter-channel correlation

measure, and demonstrate that it can be linked to ﬁrst

order Euclidean invariants (see Hilbert [23] for the

invariant theory). Such invariants have mainly been used

for stereo-matching [24]. In this paper, we show that

the information provided can enhance steganography

detection. The underlying idea here, is that if one channel

has been affected by steganography, the inter channel

correlation will measure the local modiﬁcations.

Starting from the local correlation of red and green

channels (similar to the correlation of red and blue

channels):

CorrR,G (i, j, k, l) = X

(i0,j0)∈Wi,j

X(R)

i0,j0·X(G)

k+i0,l+j0

(11)

with:

4Security Comm. Networks 2015; 00:1–12 c

2015 John Wiley & Sons, Ltd.

DOI: 10.1002/sec

Prepared using secauth.cls

A. N. Other A demonstration of the Security Comm. Networks class ﬁle

Cover / Stego

Red channel

Green channel

Blue channel

Figure 2. Features extraction: Cosine of the gradient angles [8].

•X(R)

i0,j0∈[0,255], being a pixel value at position

(i0, j0)in the red channel,

•X(G)

k,l ∈[0,255], being a pixel value at position

(k, l)in the green channel,

• Wi,j , representing a small window centered in

(i, j).

Considering (k, l) = (0,0) and a limited development of

X(R)and X(G)around (i, j), then:

CorrR,G (i, j, 0,0) =

h= (i0−i, j0−j)

(i0, j0)∈ Wi,j

X(R)

i,j +∇X(R)

i,j ·hX(G)

i,j +∇X(G)

i,j ·h.

(12)

Developing this equation leads to four terms. Three of

which are constant or not informative, then there is only

one informative term :

∇X(R)

i,j · ∇X(G)

i,j .(13)

If only one channel has been altered locally, the gradient in

this channel is modiﬁed. Consequently, the scalar product

of two channel gradients reﬂects the change in the cosine

of the difference between the two gradient angles.

Similarly, we can apply the same computation for the

red and blue channel and then obtain :

∇X(R)

i,j · ∇X(B)

i,j .(14)

As stated by Gouet et al. [24] (and following the

Hilbert theory [23]), it is unnecessary to investigate the

∇X(G)

i,j · ∇X(B)

i,j term, as it is already implicitly contained

in the ﬁrst two expressions (Eq. 13 and 14).

Normalizing these expressions, we obtain the cosine of

rotation angles, between channel gradients:

CRG =∇X(R)

i,j · ∇X(G)

i,j

|∇X(R)

i,j | |∇X(G)

i,j |,(15)

CRB =∇X(R)

i,j · ∇X(B)

i,j

|∇X(R)

i,j | |∇X(B)

i,j |.(16)

Fig. 2illustrates our preprocessing steps [8] to obtain

the cosine of rotation angles, between channel gradient.

Note that gradients derivatives of each channel are

estimated by a convolution with a [-1; 1] mask (horizontal

and vertical).

3.2. Mirror transformations

In the preceding section, we have seen that the inter-

channel correlation is linked with the scalar product of

gradients (i.e. Euclidean invariants). This means that if

we are able to measure the absolute value of a rotation

angle between two channel gradients, we still need the

direction of the rotation, which is linked this time to Mirror

transformations (as illustrated in Fig. 3).

Our proposition is to add two new features sets based

on the determinants of channel gradients. Similar to that

applied in the recent work of Abdulrahman et al. [8], the

features are directly linked to the correlation in order to

obtain new features of Sine of the gradients angle. Finally,

as illustrated in Fig. 4, we normalize these determinants by

gradient norms to obtain the sine of the rotations:

SRG =∇X(R)

i,j [0] · ∇X(G)

i,j [1] − ∇X(R)

i,j [1] · ∇X(G)

i,j [0]

|∇X(R)

i,j | |∇X(G)

i,j |,

(17)

SRB =∇X(R)

i,j [0] · ∇X(B)

i,j [1] − ∇X(R)

i,j [1] · ∇X(B)

i,j [0]

|∇X(R)

i,j | |∇X(B)

i,j |,

(18)

with ∇X[0] (resp. ∇X[1]) the ﬁrst (resp. second)

component of the vector ∇Xi.e. corresponding to the

horizontal and the vertical derivatives (see Fig. 4).

3.3. Complete feature set

Our features, are computed from CRG ,CRB ,SRG and

SRB correlations by computing the co-occurrence matrices

as in the Rich Model [18]. We used different values of

the quantization q∈ {0.1,0.3,0.5,0.7,0.9,1}with ﬁxed

truncation T=1. The reason for using these different values

of quantization qis that GRG ,GRB ,SRG and SRB belong

Figure 3. Rotation angle between two channel gradients

cos(α1) = cos(α2)but sin(α1) = −sin(α2)

cos(θ1) = cos(θ2)but sin(θ1) = −sin(θ2).

Sine is essential to determine the direction of the rotation.

Security Comm. Networks 2015; 00:1–12 c

2015 John Wiley & Sons, Ltd. 5

DOI: 10.1002/sec

Prepared using secauth.cls

A demonstration of the Security Comm. Networks class ﬁle A. N. Other

Cover/ Stego _

Red channel

Green channel

Blue channel

Figure 4. Features of extraction: Sine of the gradients angles extracting information from the direction of the local rotation.

to [−1,1]. Moreover, the use of these values gives more

accurate features and avoids the generation of too many

zero values caused by the truncation step in the co-

occurrence vector. For each quantization, we obtain 12

submodels from methods 1 [8] and 12 submodels from the

new proposed method 2 ∗. The submodels from the Color

Rich Models [9] give 18157 features, those of the method

1 [8] give 3000 features, and those of our proposed method

2 give 3000 features. Accordingly, the ﬁnal feature vector

collects a ﬁnal set of 24157 features.

4. THE ENSEMBLE CLASSIFIERS

An ensemble of classiﬁers [25] is a set of classiﬁers

whose individual decisions are combined and organized

into weighted or unweighted votes to classify the data sets

(in this work, features represent these data sets, as detailed

in the previous sub-section).

Modern steganalysis methods for digital images are

based on feature extraction. These methods need machine

learning techniques to detect if the media contains hidden

messages or not. In our work, we choose ensemble

classiﬁers [26] because of their efﬁcient classiﬁcation

performance for large scale learning.

Kodovsky et al. [26] proposed ensemble classiﬁers†

which is a machine learning tool for steganalysis,

∗For method 1 (resp. method 2) we use one symmetrized spam14h and

one spam14v submodel, with 25 features each. We also use the minmax22h,

minmax22v, minmax24, minmax34h, minmax34v, minmax41, minmax34,

minmax48h, minmax48v, and minmax54 submodels with 45 features for each.

All submodels are gathered in a one dimension vector to erect a dimensionality

of (2 ×25 + 10 ×45) ×6 = 3000 features. For more details on submodels

construction, the reader is invited to look at article [18].

†Ensemble classiﬁer is available at http://dde.binghamton.edu/

download/ensemble.

consisting of many classiﬁer Lindependently trained (Bl)

designed to keep complexity to a minimum and make the

overall process simple.

Each base learner is trained on randomly selected

subspaces dsub-dimensionals of the original feature space,

from the entire full d-dimension feature space. The authors

use Ficher Linear Discriminants (F LD)as base learners

and the ﬁnal decision is made by aggregating the decision

of individual base learners. Let dbe a full dimensional

feature space, Ntrn and Ntst the number of training

and testing samples from each class. First, the classiﬁers

construct a number Lof F LD base learners (Bl)with l∈

{1, ..., L}. Each one performs its learning on a subspace of

dsub dimension, where dsub << d. From the ith image,

a feature vector, fi∈Rd, is extracted, and then mapped,

such as Rd→ {0,1}, where 000stands for cover and 010

for stego.

In the learning phase, each classiﬁer learns to map a

feature vector fi, to the correct class number:

F LDl:Rd→ {0,1}

fi→F LDl(fi).

Each classiﬁer uses the training database to compute the

orthogonal vector to the hyperplane separating the two

classes. For a test feature, the lth base learner reaches

its decision by computing a projection and comparing it

to a threshold. After collecting all Ldecisions, the ﬁnal

classiﬁer selects the class which has received the most

votes. Then, the decision threshold of each base learner

is adjusted to minimize the total detection error under an

equal prior on the training data [26]:

PE=minPF A

2[PF A +PM D (PF A)] ,(19)

where PF A represents the false alarm probability and

PMD the missed detection probability.

6Security Comm. Networks 2015; 00:1–12 c

2015 John Wiley & Sons, Ltd.

DOI: 10.1002/sec

Prepared using secauth.cls

A. N. Other A demonstration of the Security Comm. Networks class ﬁle

5. EXPERIMENTAL RESULTS

5.1. Experimental setup and protocol

All our features are calculated and formed in a one

dimensional vector from 10000 color covers and 10000

color stego images for each payload of steganography

methods. These features are ready to enter in the classiﬁer.

The classiﬁers were implemented using the ensemble

classiﬁer [26] with many FLD as a base learner. In this

paper, the detection accuracy is measured by the total

probability of the average of testing errors under equal

priors as in Eq. 19.5000 images from a database are

randomly chosen for the training sets and 5000 for the

testing sets. The ensemble classiﬁers apply a vote to

estimate the error of detection. This process is repeated

10 times to obtain ¯

PE, the average of testing errors. ¯

quantify the detectability and are collected for each method

and payload to evaluate the steganalysis method. Given the

decision values, ROC curves are obtained. As illustrated

in Fig. 8, the area under the ROC curves is calculated as

the accuracy of the ensemble classiﬁers.

5.1.1. Image Dataset

A raw image is a class of computer ﬁle containing

untouched pixel information coming from the digital

camera sensor (i.e. the pure information). These ﬁles

hold a large amount of meta-information about the image

generated by the camera [27].

In our work, the color image database is very carefully

built depending on the CF A idea. We collected raw

images from two subsets which are the most standard,

and have the highest number of images captured (i.e.

the Dresden Image Database’s [28]3500 full-resolution

Nikon digital camera raw color images and the Break Our

Steganographic System (BOSSbase‡), with 1000 Canon

digital camera raw color images).

In order to obtain color images in Portable Pixel Map

(PPM)format of size 512×512, all images take the

same CF A map layout, as illustrated in Fig. 6. For this

process, two steps are required. The ﬁrst step consists of

using a demosaicking algorithm to convert raw images into

demosaicked images. The second step consists of cropping

ﬁve areas from one image. Fig. 5shows sample images

produced by the cropping step.

First we used the demosaicking algorithm Patterned

Pixel Grouping (PPG) from the dcraw software§to convert

raw images into RGB images. As illustrated in Fig.6, the

obtained images are such that the Bayer Pattern is always

of the type RGBG (red channel pixel is placed at an

even position). We wrote a spatial code to start the crop

from the red channel position. Indeed, from one image,

this code randomly selected the red channel position and

‡BOSSbase can be accessed at http://www.agents.cz/boss/

BOSSFinal.

§dcraw code is available at http://www.cybercom.net/defin/dcraw.

a) Original Raw image b) Crop 1 c) Crop 2

d) Crop 3 e) Crop 4 f) Crop 5

Figure 5. Sample images of our database built by random

cropping from locations of red channel pixels (even position) in

a Bayer pattern :

a) Original raw image 3906×2602,

b) crop 1 position x=2116, y=1928,

c) crop 2 position x=902, y=1182,

d) crop 3 position x=3080, y=436,

e) crop 4 position x=1866, y=1778,

f) crop 5 position x=650, y=1032.

cropped ﬁve images using a size of 512×512 pixels, so

that all blocks share the same CF A map layout. The ﬁnal

number of images is 10000 RGB color images with a size

of 512×512.

5.1.2. Embedding methods

The stego images are obtained using three spatial-

domain steganography algorithms. The ﬁrst method is

the Spatial-UNIversal WAvelet Relative Distortion (S-

UNIWARD¶) steganography algorithm [29]. The second

method is the Wavelet Obtained Weights (WOWk)

steganography algorithm [30]. Finally, the third method is

the Synchronizing the Selection Channel (Synch-HILL∗∗)

steganography algorithm [31].

These algorithms are used to embed messages into

color images by decomposing the R, G and Bchannels as

three grayscale images and embedding the same proportion

payload into each channel. Also, different tested payload

sizes are used {0.01,0.05,0.1,0.2,0.3,0.4and 0.5}Bit

Per Channel (BPC).

¶S-UNIWARD steganography method is available at http://dde.

binghamton.edu/download/stego_algorithms/.

kWOW steganography method is available at http://dde.binghamton.

edu/download/stego_algorithms/.

∗∗Synch-HILL steganography method is available at http://dde.

binghamton.edu/download/stego_algorithms/.

Security Comm. Networks 2015; 00:1–12 c

2015 John Wiley & Sons, Ltd. 7

DOI: 10.1002/sec

Prepared using secauth.cls

A demonstration of the Security Comm. Networks class ﬁle A. N. Other

dcraw

code

Our Crop

code

Raw

image

Raw image

database CFA Bayer

pattern

Start crop from

Red channel

PPM format

color image

database

Figure 6. The preprocessing steps for building our database depending on the CF A idea.

5.2. Results and Discussion

This section contains the experimental results of

our proposed method. We illustrate these results

in Table II. S-UNIWARD, WOW and Synch-HILL

methods were tested with different relative payloads

{0.01,0.05,0.1,0.2,0.3,0.4,0.5}(bpc) against three

approaches: method 1 [8], the Color Rich Model [9] and

the CFA-aware features steganalysis [22]. We used the

same set of payload values with the same embedding

methods. Our proposed second method, that uses both the

sine and cosine of the gradients angle, achieved higher

performance by registering 88.76%,87.93% and 88.07%

detection rates for S-UNIWARD, WOW and synch-HILL

respectively (with the payload 0.5bpc). The Color Rich

Model method [9] is less efﬁcient because it achieved

respectively 86.14%,85.27% and 85.25% detection.

Also, the CFA-aware features method [22] is less efﬁcient

because it achieved respectively 87.61%,87.04% and

87.42% detection rates. Close to the CFA-aware features

method, the method of Abdulrahman et al. [8] is less

efﬁcient because it achieved respectively 87.54%,86.63%

and 86.77% detection rates. We noted the same trend with

the rest of the payload values, as shown in Table II.

Additionally, as shown in Table II, the method of

Abdulrahman et al. [8], that uses the cosine of the

gradients angle, achieved higher performance than Color

Rich Model method [9]; by registering 87.54%,86.63%

and 86.77% detection rates for S-UNIWARD, WOW and

synch-HILL respectively with the payload 0.5bpc. For the

same payloads range, the Color Rich Model method [9]

is less efﬁcient because it achieved respectively 86.14%,

85.27% and 85.25% detection rates on the same test

samples. Also, as shown in Table II, our proposed second

method, that uses the sine and cosine of the gradients

angle, achieved higher performance than CFA-aware

features steganalysis method [22]; by registering 88.76%,

87.93% and 88.07% detection rates for S-UNIWARD,

WOW and synch-HILL respectively with the payload 0.5

bpc. The CFA-aware features steganalysis method [22]

is less efﬁcient because it achieved respectively 87.61%,

87.04% and 87.42% detection rates on the same test

samples.

Moreover, curves in Fig.7(a) S-UNIWARD, (b) WOW

and (c) synch-HILL steganography method also, illustrate

the comparison between the proposed second method and

the compared methods. As a result, the average testing

error of the proposed second method is less than the ﬁrst

proposition, the Color Rich Model and CFA-aware features

method. That proves the importance of the additional 3000

features proposed by the second method.

Another experiment involved embedding the entire

payload in only one channel of the color image, i.e. with

payload 0.2bpc and 0.4bpc in the green channel only.

In this case, the detection rate becomes higher than the

same payload distributed equally between the three color

channels. Table III illustrates the comparison of detection

rates between the S-UNIWARD, WOW and synch-HILL

methods with payloads 0.2bpc and 0.4bpc embedded

in one channel only and in the three channels separately.

Fig. 8(a), (b) and (c) show the ROC curves, illustrating

the performance of our method 2. Finally, this experiment

revealed that it is easier to detect a hidden message in

only one channel than a message that is spread across all

channels.

Table III. Our proposed method 2 detection rate of S-UNIWARD,

WOW and Synch-HILL steganography methods at 0.2 bpc and

0.4 bpc payload embedding in the green channel compares with

equal embedding in three channels.

S-UNIWARD WOW Synch-HILL

Payload G%RGB%G%RGB%G%RGB%

0.2 90.02 78.09 88.51 76.19 89.23 77.31

0.4 96.77 87.11 94.83 86.16 94.87 86.89

6. CONCLUSION

In this paper, we have proposed new features for

steganalysis of color images. Starting from the Color Rich

Model proposed by Goljan et al. [9], we have shown

that this method could be greatly enhanced by considering

8Security Comm. Networks 2015; 00:1–12 c

2015 John Wiley & Sons, Ltd.

DOI: 10.1002/sec

Prepared using secauth.cls

A. N. Other A demonstration of the Security Comm. Networks class ﬁle

Table II. Numerical values of the average testing error ¯

PEand the detection rate PD%for three steganography methods. For easier

navigation the dark gray background column presents the ﬁrst method of Abdulrahman et al. [8] and the light gray background column

presents the second proposed method.

Color Rich CFA-Aware Method 1 Method 2

payload ¯

PEPD%¯

PEPD%

S-UNIWARD

0.01 0.4841 51.59 0.4863 51.37 0.4830 51.70 0.4680 53.20

0.05 0.4045 59.55 0.4072 59.28 0.4010 59.90 0.3859 61.41

0.10.3298 67.02 0.3194 68.06 0.3203 67.97 0.3037 69.63

0.20.2498 75.02 0.2317 76.83 0.2370 76.30 0.2191 78.09

0.30.1947 80.53 0.1806 81.94 0.1808 81.92 0.1623 83.77

0.40.1599 84.01 0.1429 85.71 0.1470 85.30 0.1289 87.11

0.50.1386 86.14 0.1239 87.61 0.1246 87.54 0.1124 88.76

WOW

0.01 0.4850 51.50 0.4875 51.25 0.4836 51.64 0.4753 52.47

0.05 0.4092 59.08 0.4174 58.26 0.4042 59.58 0.3906 60.94

0.10.3397 66.03 0.3275 67.25 0.3317 66.83 0.3161 68.39

0.20.2654 73.46 0.2440 75.60 0.2502 74.98 0.2381 76.19

0.30.2081 79.19 0.1895 81.05 0.1918 80.82 0.1793 82.07

0.40.1783 82.17 0.1487 85.13 0.1574 84.26 0.1384 86.16

0.50.1473 85.27 0.1296 87.04 0.1307 86.63 0.1207 87.93

Synch-HILL

0.01 0.4893 51.07 0.4843 51.57 0.4814 51.83 0.4687 53.13

0.05 0.3991 60.09 0.4030 59.70 0.3879 61.21 0.3720 62.80

0.10.3311 66.89 0.3189 68.11 0.3258 67.42 0.3086 69.14

0.20.2595 74.05 0.2394 76.06 0.2438 75.62 0.2269 77.31

0.30.1997 80.03 0.1753 82.47 0.1829 81.71 0.1607 83.93

0.40.1684 83.16 0.1478 85.22 0.1540 84.60 0.1311 86.89

0.50.1475 85.25 0.1258 87.42 0.1323 86.77 0.1193 88.07

.01 0.05 0.1 0.2 0.3 0.4 0.5

0.05

0.10

0.15

0.20

0.25

0.30

0.35

0.40

0.45

0.50

Embedding payload

Probabilty of error PE

Color rich model

CFA−Aware

Proposed method 1

Proposed method 2

S-UNIWARD

.01 0.05 0.1 0.2 0.3 0.4 0.5

0.05

0.10

0.15

0.20

0.25

0.30

0.35

0.40

0.45

0.50

Embedding payload

Probabilty of error PE

Color rich model

CFA−Aware

Proposed method 1

Proposed method 2

WOW

.01 0.05 0.1 0.2 0.3 0.4 0.5

0.05

0.10

0.15

0.20

0.25

0.30

0.35

0.40

0.45

0.50

Embedding payload

Probabilty of error PE

Color rich model

CFA−Aware

Proposed method 1

Proposed method 2

Synch-HILL

(a) S-UNIWARD (b) WOW (c) Synch-HILL

Figure 7. Avarage testing error ¯

PEas a function of the payload for (a) S-UNIWARD,(b) WOW and (c) WOW steganography methods,

comparison between the steganalysis methods (Color Rich Model, CFA-aware features steganalysis, method 1 [8] and proposed

method 2).

Security Comm. Networks 2015; 00:1–12 c

2015 John Wiley & Sons, Ltd. 9

DOI: 10.1002/sec

Prepared using secauth.cls

A demonstration of the Security Comm. Networks class ﬁle A. N. Other

0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1

0.1

0.2

0.3

0.4

0.5

0.6

0.7

0.8

0.9

False positive rate

True positive rate

S−UNWARD steganography

payload 0.2 bpc.

Green channel only

RGB channels

0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1

0.1

0.2

0.3

0.4

0.5

0.6

0.7

0.8

0.9

False positive rate

True positive rate

S−UNWARD steganography

payload 0.4 bpc.

Green channel only

RGB channels

(a) S-UNIWARD

0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1

0.1

0.2

0.3

0.4

0.5

0.6

0.7

0.8

0.9

False positive rate

True positive rate

WOW steganography

payload 0.2 bpc.

Green channel only

RGB channels

0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1

0.1

0.2

0.3

0.4

0.5

0.6

0.7

0.8

0.9

False positive rate

True positive rate

WOW steganography

payload 0.4 bpc.

Green channel only

RGB channels

(b) WOW

0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1

0.1

0.2

0.3

0.4

0.5

0.6

0.7

0.8

0.9

False positive rate

True positive rate

Synch−HILL steganography

payload 0.2 bpc.

Green channel only

RGB channels

0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1

0.1

0.2

0.3

0.4

0.5

0.6

0.7

0.8

0.9

False positive rate

True positive rate

Synch−HILL steganography

payload 0.4 bpc.

Green channel only

RGB channels

Figure 8. ROC curves using our proposed method 2 feature set, for (a) S-UNIWARD, (b) WOW and (c) Synch-HILL steganography

methods for payloads 0.2 bpc (up) and 0.4 bpc (down), to compare the detectability when embedding messages in only one channel

with embedding messages spread in all channels.

10 Security Comm. Networks 2015; 00:1–12 c

2015 John Wiley & Sons, Ltd.

DOI: 10.1002/sec

Prepared using secauth.cls

A. N. Other A demonstration of the Security Comm. Networks class ﬁle

local deformation between channels. We have proposed

to add to the Color Rich Model a new set of features

based on local Euclidean and mirror transformation. The

euclidean transformation, proposed by Abdulrahman et al.

[8], is estimated by a ﬁrst set of features derived from

correlations between the gradients of red, green and blue

channels. Since these features give the cosine of angles

between gradients, we still do not know the direction of

the rotation between two channel gradients. Then, we have

shown that by taking into account mirror transformations,

we can obtain the missing information of the direction of

the local rotation. According to this analysis, we add a new

set of features based on the sine of local rotation angles.

These two sets of features are then incorporated in the Rich

Model using co-occurrence matrices in order to obtain

6000 features. The ﬁrst and second set gives 3000 features

each [8]. The total feature set is formed from the Color

Rich Model, plus the two new sets demonstrated in this

work, in order to build a vector of a total of 24157 features.

We used a quantization step with a set of values that differs

from the Color Rich Models. All feature vectors are fed to

the Ensemble Classiﬁer. The Ensemble Classiﬁer is used

to detect the presence of hidden messages. Eventually,

multiple steganalysis comparisons have been achieved

between the proposed method and the initial Color Rich

Model [9] and CFA-aware features steganalysis method

[22]. We have used three steganography methods ( S-

UNIWARD, WOW and Synch-HILL ) with seven different

payloads. All the experiments show that our new method

outperforms the Color Rich Model and the CFA-aware

feature steganalysis.

Our future work will focus on developing a new

steganalysis method for digital color images using

steerable ﬁlters.

6.1. Acknowledgements

The authors wish to thank the Iraqi Ministry of Higher

Education and Scientiﬁc Research for funding and

supporting this work.

REFERENCES

1. Rhodes-Ousley, Mark, 2013. The Complete Reference

Information security, (Second Edition), Copyright

by The McGraw-Hill Companies, ISBN: 978-0-07-

178436-8.

2. Bloisi, Domenico Daniele and Iocchi Luca, 2007.

Image Based Steganography and Cryptography, Inter-

national Conference on Computer Vision Theory and

Applications (VISAPP) (1), pp. 127–134, Barcelona,

Spain, March 8-11.

3. C. Hosmer, and C. Hyde, 2003. Discovering Covert

Digital Evidence, Third Digital Forensic Research

Workshop (DFRWS), pp. 1–5, Cleveland, Ohio, USA,

August 6-8.

4. K. Shrikants and S. L. Nalbalwar, 2010. Review:

Steganography-Bit Plane Complexity Segmentation

(BPCS) Technique, International Journal of Engineer-

ing Science and Technology, Vol. 2, No. 9, pp. 4860–

4868.

5. J. Fridrich, 2005. Feature-based Steganalysis for

JPEG Images and its Implications for Future Design of

Steganographic Schemes, 6th International Workshop

on Information Hiding, LNCS, Vol. 3200, pp. 67–81,

Springer-Verlag, Berlin Heidelberg.

6. J. Fridrich, 2009. Steganography in Digital Media:

Principles, Algorithms, and Applications, Cambridge

University Press, Cambridge, England.

7. J. Fridrich, and M. Long, 2000. Steganalysis of

LSB Encoding in Color Images, IEEE International

Conference on Multimedia and Expo (ICME), Vol. 3,

pp. 1279–1282, New York, NY, USA.

8. H. Abdulrahman, M. Chaumont, P. Montesinos

and B. Magnier, 2015. Color Images Steganalysis

Using Correlation Between RGB Channels, 10th

International Conference on Availability, Reliability

and Security, (IWCC), pp. 448–454, Toulouse, France,

August 24-28.

9. M. Goljan, J. Fridrich and R. Cogranne, 2015. Rich

Model for Steganalysis of Color Images, In IEEE

National Conference on Parallel Computing Tech-

nologies (PARCOMPTECH), pp. 185–190, Campus,

Mathikere, Bengaluru, India, Febuary 19-20.

10. J. Fridrich, M. Goljan and R. Du, 2001. Reliable

Detection of LSB Steganography in Color and

Grayscale Images, Proceedings of ACM Workshop on

Multimedia and Security, pp. 27–30, Ottawa, Canada.

11. A. Westfeld and A. Pﬁtzmann, 2000. Attacks on

steganographic systems, Information Hiding, Springer,

Vol. 1768, pp. 61–76.

12. A. D. Ker, 2005. Resampling and the Detection of LSB

Matching in color bitmaps, In International Society for

Optics and Photonics, Electronic Imaging pp. 1–15.

13. J. J. Harmsen and W. A. Pearlman, 2003. Steganalysis

of Additive-Noise Modelable Information Hiding, In

Electronic Imaging International Society for Optics

and Photonics, In: Proc. of SPIE, pp. 131–142.

14. P. Thiyagarajan, G. Aghila, and V. P. Venkatesan,

2011. Steganalysis using Color Model Conversion,

International Journal of Signal and Image Processing

(SIPIJ), Vol. 2, No. 4, pp. 201–211.

15. S. Lyu and H. Farid, 2004. Steganalysis using

Color Wavelet Statistics and One-Class Support

Vector Machines, Proceedings (SPIE), Electronic

Imaging, Security, Steganography, and Watermarking

of Multimedia Contents VI, San Jose, CA, Vol. 5306,

pp. 35–45.

16. M. Kirchner and R. Bohme, 2014. Steganalysis

in Technicolor Boosting WS Detection of Stego

Images From CFA-Interpolated Covers, In IEEE

International Conference on Acoustics, Speech and

Signal Processing (IEEE ICASSP), pp. 3982–3986,

Security Comm. Networks 2015; 00:1–12 c

2015 John Wiley & Sons, Ltd. 11

DOI: 10.1002/sec

Prepared using secauth.cls

A demonstration of the Security Comm. Networks class ﬁle A. N. Other

Florence, Italy, May 4–9.

17. J. Fridrich and M. Goljan, 2004. On Estimation

of Secret Message Length in LSB Steganography in

Spatial Domain, In International Society for Optics

and Photonics in Electronic Imaging, in Proc. EI SPIE,

Vol. 5306, pp. 23–34, Jose San, CA.

18. J. Fridrich and J. Kodovsky, 2012. Rich Models for

Steganalysis of Digital Images, In IEEE Transactions

on Information Forensics and Security, Vol. 7, No. 3,

pp. 868–882.

19. H. J. Olguin-Garcia, O. U. Juarez-Sandoval, M.

Nakano-Miyatake, H. Perez-Meana, 2015. Color

Image Steganalysis Method for LSB Matching, Pro-

ceedings of the International Conference on Security

and Management (SAM), the Steering Committee of

The World Congress in Computer Science, Computer

Engineering and Applied Computing (WorldComp),

pp. 309, Las Vegas, Nevada, USA, July 27-30.

20. B. E. Bayer, 1975. Color Imaging Array, Google

Patents, US Patent 3,971,065, ﬁled March 5, 1975, and

issued July 20, 1976.

21. J. Wang, C. Zhang, and P. Hao, 2011. New

Color Filter Arrays of high Light Sensitivity and

high Demosaicking Performance, In 18th IEEE

International Conference on Image Processing (ICIP),

pp. 3153–3156, Brussels, Belgium, September 11-14.

22. M. Goljan and J. Fridrich, 2015. CFA-Aware Features

for Steganalysis of Color Images, Proc. SPIE,

Electronic Imaging, Media Watermarking, Security,

and Forensics, Vol. 9409, San Francisco, CA, February

8–12.

23. D. Hilbert, 1993. Theory of Algebraic Invariants,

Cambridge University Press, Cambridge.

24. V. Gouet, P. Montesinos, and D. Pel´

e, 1998. A Fast

Matching Method for Color Uncalibrated Images

using Differential Invariants, Proceedings of the

British Machine Vision Conference (BMVC), pp. 1–

10, Southampton, September 3–7.

25. T. G. Dietterich, 2000. Ensemble Methods in Machine

Learning, In First International Workshop on Multiple

Classiﬁer Systems (MCS), pp. 1–15, Sardinia, Italy,

June 21–23.

26. J. Kodovsky, J. Fridrich, and V. Holub, April 2012.

Ensemble Classiﬁers for Steganalysis of Digital

Media, In IEEE Transactions on Information Forensics

and Security, Vol. 7, No. 2, pp. 432–444.

27. L. Yuan and J. Sun, 2011. High Quality Image

Reconstruction from Raw and JPEG Image Pair, In

13th IEEE International Conference on Computer

Vision (ICCV), pp. 2158–2165, Barcelona, Spain,

November 6–13.

28. T. Gloe and R. B¨

ohme, 2010. The Dresden Image

Database for Benchmarking Digital Image Forensics,

Journal of Digital Forensic Practice, Vol. 3, pp. 150–

159.

29. V. Holub, J. Fridrich, and T. Denemark, 2014.

Universal Distortion Function for Steganography

in an Arbitrary Domain, EURASIP Journal on

Information Security, Special Issue on Revised

Selected Papers of the 1st ACM Information Hiding

(IH) and the ACM Multimedia and Security (MMSec)

Workshop, Vol. 1, pp. 1–13.

30. V. Holub and J. Fridrich, 2012. Designing Stegano-

graphic Distortion using Directional Filters, In IEEE

International Workshop on Information Forensics

and Security (WIFS), pp. 234–239, Tenerife, Spain,

December 2–5.

31. T. Denemark and J. Fridrich, 2015. Improving

Steganographic Security by Synchronizing the Selec-

tion Channel, Proceedings of the 3rd ACM Work-

shop on Information Hiding and Multimedia Security,

(ACM), pp. 5–14, Portland, Oregon, June 17–19.

12 Security Comm. Networks 2015; 00:1–12 c

2015 John Wiley & Sons, Ltd.

DOI: 10.1002/sec

Prepared using secauth.cls

Color image steganalysis based on quaternion discrete cosine transform

Article

Full-text available

May 2023

With the rapid development and application of Internet technology in recent years, the issue of information security has received more and more attention. Digital steganography is used as a means of secure communication to hide information by modifying the carrier. However, steganography can also be used for illegal acts, so it is of great significance to study steganalysis techniques. The steganalysis technology can be used to solve the illegal steganography problem of computer vision and engineering applications technology. Most of the images in the Internet are color images, and steganalysis for color images is a very critical problem in the field of steganalysis at this stage. Currently proposed algorithms for steganalysis of color images mainly rely on the manual design of steganographic features, and the steganographic features do not fully consider the internal connection between the three channels of color images. In recent years, advanced steganography techniques for color images have been proposed, which brings more serious challenges to color image steganalysis. Quaternions are a good tool to represent color images, and the transformation of quaternions can fully exploit the correlation among color image channels. In this paper, we propose a color image steganalysis algorithm based on quaternion discrete cosine transform, firstly, the image is represented by quaternion, then the quaternion discrete cosine transform is applied to it, and the coefficients obtained from the transformation are extracted to design features of the coeval matrix. The experimental results show that the proposed algorithm works better than the typical color image steganalysis algorithm.

Novel color image steganalysis method based on RGB channel empirical modes to expose stego images with diverse payloads

Article

Full-text available

Sep 2022
PATTERN ANAL APPL

With the rapid development of technology in the modern digital world, covert communication of secret information as a payload without instigating visible attention by using steganography emerged as a possible threat. Steganographic methods select either image contents like edges or random regions of image for hiding payload. Steganalysis methods generally concentrate on content-adaptive algorithms of grayscale images but only few works concentrate on color image steganalysis. To address this issue, a generalized steganalyzer that can identify suspicious content created using steganography methods in digital color images is a need of the hour. In this paper, a novel FroFeat feature extracted from decomposed components of three color channels using empirical mode decomposition process is proposed to augment the existing color rich model features to detect stego images created using five content—adaptive and eight non-content—adaptive steganography methods. These empirical mode decomposed components eliminate the image content including edges and pave way to model the subtle stego noise hidden inside stego images. The proposed method is validated by comparing the performance metrics with existing state-of-the-art steganalysis models. Based on the experimental results, the proposed method achieves an average of 0.484 decrease in detection error for low-volume payload detection compared to the existing methods. Also in this paper, mixed generic color image steganalysis is performed to showcase the generalization ability of the proposed steganalysis method.

Digital Image Steganalysis: Current Methodologies and Future Challenges

Article

Full-text available

Jan 2022

With the growing use of the internet and social media, data security has become a major issue. Thus, researchers are focusing on data security techniques such as steganography and steganalysis. Steganography is the approach of concealing the existence of secret messages in digital media for secure transmission. Steganalysis techniques aim to detect the existence of concealed messages and extract them. Digital image steganography and steganalysis techniques are classified into the spatial and transform domains. In this paper, we provide a detailed survey of the state-of-the-art works that have been performed in two-dimensional and three-dimensional image steganalysis. We present the most popular datasets and explain some steganographic methods for embedding hidden data. Steganalysis is a very difficult task due to the lack of information about the characteristics of the cover media that can be exploited to detect hidden messages. Therefore, we review studies performed on image steganalysis in the spatial and transform domains using classical machine learning and deep learning approaches. Additionally, we present open challenges and discuss some directions for future research.

Feature selection method for color image steganalysis based on fuzzy neighborhood conditional entropy

Article

Full-text available

Jun 2022
APPL INTELL

The color image steganalysis method creats many redundant features during feature extraction, which reduces the classification accuracy. To reduce the dimensionality of color image steganalysis features and improve classification accuracy, this paper proposes the C-FNCES method. First, we use the Fisher score to evaluate the importance of each feature, providing the basis for selecting the features of color image steganalysis. Second, the fuzzy neighborhood decision information system is introduced into the color image steganalysis feature since it can effectively process continuous data. The decision information system of color image steganalysis based on a fuzzy neighborhood is constructed. Then, we propose the fuzzy neighborhood conditional entropy model. The model is used to evaluate the role of features, providing a theoretical basis for feature selection in color image steganalysis. Finally, according to the Fisher score and fuzzy neighborhood condition entropy model, a steganalysis feature selection algorithm is designed. Our experiment showed that the C-FNCES method can not only effectively reduce the feature dimension but also improve the classification accuracy, which is better than the Steganalysis-α and CGSM methods.

A Quaternion Two‐Stream R‐CNN Network for Pixel‐Level Color Image Splicing Localization

Article

Full-text available

Nov 2021

Recently, Zhou et al. designed a two-stream faster Region-Convolutional neural networks (R-CNN) model RGB-N for color image splicing localization in CVPR2018. However, the RGB-N locates spliced regions only at block-level and ignores the entirety and inherent correlation of three channels. Therefore, an improved quaternion two-stream R-CNN model is proposed to solve these drawbacks: a mask branch combining fully convolutional network and condition random field is added for locating spliced regions at pixel-level; quaternion representation of color images is used to process color spliced images in a holistical way. In addition, feature pyramid network based on quaternion residual network is considered to extract multi-scale features for color spliced images; attention region proposal network is combined with attention mechanism and is designed to pay more attention to the spliced regions; a high-pass filter designed for image splicing detection specifically is adopted to replace steganalysis rich model filter in the RGB-N to obtain noise input for the noise stream. Experimental results on a new synthetic dataset and three standard forgery datasets demonstrate that the proposed method is superior to the existing methods in the abilities of localization, generalization, and robustness.

Universal Deep Network for Steganalysis of Color Image Based on Channel Representation

Article

Jan 2022

Up to now, most existing steganalytic methods were designed for grayscale images, and are not suitable for the color images that are widely used in social networks. In this paper, we design a universal color image steganalysis network (called UCNet) for the spatial and JPEG domains. The proposed method includes preprocessing, convolutional, and classification modules. To preserve the steganalytic features in each color channel, the preprocessing module first separates the input image into three channels based on the corresponding embedding spaces (i.e., RGB in the spatial domain, and YCbCr in the JPEG domain), and then extracts the image residuals with 62 fixed high-pass filters. Finally, all truncated residuals are concatenated for subsequent analysis, rather than adding them together in the first layer as in existing CNN-based steganalyzers. To accelerate network convergence and effectively reduce the number of parameters, the convolutional module contains three carefully designed types of layers with different shortcut connections and group convolution structures, to further learn the high-level steganalytic features. In the classification module, we employ global average pooling and a fully connected layer for classification. We conduct extensive experiments on ALASKA II to demonstrate that the proposed method can achieve state-of-the-art results that are comparable with other modern CNN-based steganalyzers (e.g., SRNet and LC-Net) in both the spatial and JPEG domains, with relatively few memory requirements and short training times. Furthermore, we also provide some necessary descriptions and carry out numerous ablation experiments to verify the rationality of the network design.

Gradually Enhanced Adversarial Perturbations on Color Pixel Vectors for Image Steganography

Article

Aug 2022

Compared to element-wise embedding, vector-wise embedding based on CPV (color pixel vector) shows its superiority in color image steganography. However, when working with an adversarial embedding scheme for introducing adversarial perturbations, its success rate of deceiving a target CNN (convolutional neural network) steganalyzer dramatically drops. In this paper, inspired by the I-FGSM (iterative fast gradient sign method), we present an effective steganography for color images. Specifically, after decomposing an image into several non-overlapped sub-images, we iteratively and gradually increase the possibilities of generating adversarial perturbations for the CPVs in each sub-image by changing their adversarial costs. The costs are incrementally adjusted with a small step so that their maximum relative variation is minimized. Leveraging a new designed cost adjustment criterion, more modification patterns of CPV can participate in producing effective adversarial perturbations. Extensive experiments demonstrate that the proposed method achieves a high success rate in deceiving the target CNN steganalyzer and stably defending against the detection of other non-target steganalytic schemes for color images.

Universal Deep Network for Steganalysis of Color Image based on Channel Representation

Preprint

Nov 2021

Up to now, most existing steganalytic methods are designed for grayscale images, and they are not suitable for color images that are widely used in current social networks. In this paper, we design a universal color image steganalysis network (called UCNet) in spatial and JPEG domains. The proposed method includes preprocessing, convolutional, and classification modules. To preserve the steganographic artifacts in each color channel, in preprocessing module, we firstly separate the input image into three channels according to the corresponding embedding spaces (i.e. RGB for spatial steganography and YCbCr for JPEG steganography), and then extract the image residuals with 62 fixed high-pass filters, finally concatenate all truncated residuals for subsequent analysis rather than adding them together with normal convolution like existing CNN-based steganalyzers. To accelerate the network convergence and effectively reduce the number of parameters, in convolutional module, we carefully design three types of layers with different shortcut connections and group convolution structures to further learn high-level steganalytic features. In classification module, we employ a global average pooling and fully connected layer for classification. We conduct extensive experiments on ALASKA II to demonstrate that the proposed method can achieve state-of-the-art results compared with the modern CNN-based steganalyzers (e.g., SRNet and J-YeNet) in both spatial and JPEG domains, while keeping relatively few memory requirements and training time. Furthermore, we also provide necessary descriptions and many ablation experiments to verify the rationality of the network design.

BMP Color Images Steganographer Detection Based on Deep Learning

Chapter

Nov 2021

A user who achieves covert communication by embedding secret information in the original image is called steganographer. Steganographer detection determines which user sent a secured image with a secret message. Existing steganographer detection algorithms take gray images as the main research content. To better adapt to the reality, we propose a WiserNet-based steganograph detection algorithm for the characteristics of BMP color images, and the process is divided into the following three steps: feature extraction through each channel convolution structure, prevent the conventional convolution structure destroy the correlation between the color image channel operation, reduce the number of the extraction of feature dimension. The use of a per-channel convolution structure makes it easier to extract color image features, and the low-dimensional feature vector reduces the time required for subsequent clustering algorithms, which improves the efficiency of steganographer detection. Simulation experiments are conducted for the classification of feature extractors, detection of different steganographic rates, and detection of different image scales. First, the steganalysis binary classification results of this algorithm are compared with similar algorithms, and the classification accuracy is 84.90% when the steganalysis rate is 0.4 BPC, which is 1.11% higher than Ye-Net and 0.83% higher than Xu-ResNet. Since there is very little published research on steganography detection of color images, four feature extractors, Ye-Net, Xu-ResNet, SRNet, and WiserNet, will be used in this experiment to replace the WiserNet-100 feature extractor in the steganography detection algorithm. The results show that the detection accuracy of the algorithm proposed in this paper reaches 93% when the embedding rate is 0.2 BPC, and the detection accuracy reaches 100% when the embedding rate is greater than 0.2 BPC. The steganographic detection accuracy reaches 84% when the graph scale is 60% and the steganographic rate is 0.2 BPC. In terms of detection time, the WNCISD-100 is 7.79 s, which is 50% less time-consuming compared to SRSD.

Revisiting Perturbed Quantization

Conference Paper

Jun 2021

Steganalysis Using Colour Model Conversion

Article

Full-text available

Dec 2011

Thiyagarajan Paramasivan

The major threat in cyber crime for digital forensic examiner is to identify, analyze and interpret theconcealed information inside digital medium such as image, audio and video. There are strong indicationsthat hiding information inside digital medium has been used for planning criminal activities. In this way, itis important to develop a steganalysis technique which detects the existence of hidden messages insidedigital medium. This paper focuses on universal image steganalysis method which uses RGB to HSI colourmodel conversion. Any Universal Steganalysis algorithm developed should be tested with various stegoimagesto prove its efficiency. The developed Universal Steganalysis algorithm is tested in stego-imagedatabase which is obtained by implementing various RGB Least Significant Bit Steganographic algorithms.Though there are many stego-image sources available on the internet it lacks in the information such ashow many rows has been infected by the steganography algorithms, how many bits have been modified andwhich channel has been affected. These parameters are important for Steganalysis algorithms and it helpsto rate its efficiency. Proposed Steganalysis using Colour Model has been tested with our Image Databaseand the results were affirmative.

Rich model for Steganalysis of color images

Article

Full-text available

Apr 2015

In this paper, we propose an extension of the spatial rich model for steganalysis of color images. The additional features are formed by three- dimensional co-occurrences of residuals computed from all three color channels and their role is to capture dependencies across color channels. These CRMQ1 (color rich model) features are extremely powerful for detection of steganography in images that exhibit traces of color interpolation. Content-adaptive algo- rithms seem to be hurt much more because of their ten- dency to modify the same pixels in each channel. The efficiency of the proposed feature set is demonstrated on three different color versions of BOSSbase 1.01 and two steganographic algorithms - the non-adaptive LSB matching and WOW.

Color Image Stegananalysis Using Correlations between RGB Channels

Conference Paper

Full-text available

Aug 2015

Digital images, especially color images, are very widely used, as well as traded via Internet, e-mail and posting on websites. Images have a large size which allows embedding secret messages of large size, so they are a good medium for digital steganography. The main goal of steganalysis is to detect the presence of hidden messages in digital media. In this paper, we propose a steganalysis method based on color feature correlation and machine learning classification. Fusing features with features obtained from color-rich models allows to increase the detectability of hidden messages in the color images. Our novel proposition uses the correlation between different channels of color images. Features are extracted from the channel correlation and co-occurrence correlation. In this work, all stego images are created with a range of different payload sizes using two steganography S-UNIWARD and WOW algorithms. To validate the proposed method, his efficiency is demonstrated by comparison with color rich model steganalysis.

Universal Distortion Function for Steganography in an Arbitrary Domain

Article

Full-text available

Dec 2014

Currently, the most successful approach to steganography in empirical objects, such as digital media, is to embed the payload while minimizing a suitably defined distortion function. The design of the distortion is essentially the only task left to the steganographer since efficient practical codes exist that embed near the payload-distortion bound. The practitioner's goal is to design the distortion to obtain a scheme with a high empirical statistical detectability. In this paper, we propose a universal distortion design called universal wavelet relative distortion (UNIWARD) that can be applied for embedding in an arbitrary domain. The embedding distortion is computed as a sum of relative changes of coefficients in a directional filter bank decomposition of the cover image. The directionality forces the embedding changes to such parts of the cover object that are difficult to model in multiple directions, such as textures or noisy regions, while avoiding smooth regions or clean edges. We demonstrate experimentally using rich models as well as targeted attacks that steganographic methods built using UNIWARD match or outperform the current state of the art in the spatial domain, JPEG domain, and side-informed JPEG domain.

Feature-Based Steganalysis for JPEG Images and its Implications for Future Design of Steganographic Schemes

Article

Jan 2005

J. Fridrich

Improving Steganographic Security by Synchronizing the Selection Channel

Conference Paper

Jun 2015

This paper describes a general method for increasing the security of additive steganographic schemes for digital images represented in the spatial domain. Additive embedding schemes first assign costs to individual pixels and then embed the desired payload by minimizing the sum of costs of all changed pixels. The proposed framework can be applied to any such scheme -- it starts with the cost assignment and forms a non-additive distortion function that forces adjacent embedding changes to synchronize. Since the distortion function is purposely designed as a sum of locally supported potentials, one can use the Gibbs construction to realize the embedding in practice. The beneficial impact of synchronizing the embedding changes is linked to the fact that modern steganalysis detectors use higher-order statistics of noise residuals obtained by filters with sign-changing kernels and to the fundamental difficulty of accurately estimating the selection channel of a non-additive embedding scheme implemented with several Gibbs sweeps. Both decrease the accuracy of detectors built using rich media models, including their selection-channel-aware versions.

Ensemble methods in machine learning

Conference Paper

Jan 2000
Lect Notes Comput Sci

Thomas G. Dietterich

Ensemble methods are learning algorithms that construct a set of classifiers and then classify new data points by taking a (weighted) vote of their predictions. The original ensemble method is Bayesian averaging, but more recent algorithms include error-correcting output coding, Bagging, and boosting. This paper reviews these methods and explains why ensembles can often perform better than any single classifier. Some previous studies comparing ensemble methods are reviewed, and some new experiments are presented to uncover the reasons that Adaboost does not overfit rapidly.

CFA-aware features for steganalysis of color images

Article

Mar 2015
Proceedings of SPIE

Color interpolation is a form of upsampling, which introduces constraints on the relationship between neighboring pixels in a color image. These constraints can be utilized to substantially boost the accuracy of steganography detectors. In this paper, we introduce a rich model formed by 3D co-occurrences of color noise residuals split according to the structure of the Bayer color filter array to further improve detection. Some color interpolation algorithms, AHD and PPG, impose pixel constraints so tight that extremely accurate detection becomes possible with merely eight features eliminating the need for model richification. We carry out experiments on non-adaptive LSB matching and the content-adaptive algorithm WOW on five different color interpolation algorithms. In contrast to grayscale images, in color images that exhibit traces of color interpolation the security of WOW is significantly lower and, depending on the interpolation algorithm, may even be lower than non-adaptive LSB matching.

New Framework for a Hierarchical Intrusion Detection Mechanism in Cluster-Based Wireless Sensor Networks

Article

Jan 2011

In the last years, the technological evolution in the field of Wireless Sensor Networks was impressive, which made themextremely useful in various applications (military, commercial, etc.). In such applications, it is essential to protect the sensor network from malicious attacks. This presents a demand for providing security mechanisms in these vulnerable networks. In this paper, we design a new framework for intrusion detection in cluster-based wireless sensor networks (CWSN). In CWSN, all sensor nodes are clustered, and a Cluster Head (CH) is elected to manage the operation of its own cluster. Some sensor nodes in the cluster are elected as IDS (Intrusion detection system) agents in order to monitor the network and prevent before the intruder begins starts to the attack. This hierarchical framework is composed of different protocols that run at different levels. The first protocol is a specification-based detection protocol that runs at IDS agents (low level). The second one is a binary classification detection protocol that runs at CHs nodes (medium level). In addition, a reputation protocol is used at each CH to evaluate the trustworthiness level of its IDSs agents. Each CH monitors its CH neighbors based on a specification detection protocol with a the help of a vote mechanism applied at the base station (high level). We evaluated the performances of our framework in the presence of four kinds of attacks: hello floods, selective forwarding, black hole, and wormhole attacks.We evaluated specifically the detection rate, false positive rate, energy consumption, and efficiency. Simulation results show that our detection framwork exhibits a high detection rate (almost 100%), low number of false positives, a less time to detect the attack, and a less energy consumption.

“Steganalysis in Technicolor” Boosting WS detection of stego images from CFA-interpolated covers

Conference Paper

May 2014

Steganographic security in empirical covers is best understood for grayscale images. However, the world, and almost all digital images of it, are more colorful. This paper extends the weighted stego-image (WS) steganalysis method to detect stego images produced from covers that exhibit traces of color filter array (CFA) interpolation, which is common for images acquired with digital cameras. The approach combines techniques of CFA forensics with state-of-the-art WS steganalysis. Empirical results from large datasets indicate significant increases in detection performance, in particular for small payloads. This specific weakness of color covers calls into question the common assumption that grayscale image steganography generalizes to color images by treating each chroma channel independently.

Color images steganalysis using rgb channel geometric transformation measures

Abstract and Figures

Recommended publications

Color Image Steganalysis using Correlations between RGB Channels

Color Image Stegananalysis Using Correlations between RGB Channels

Color Image Steganalysis Based On Steerable Gaussian Filters Bank

CFA-aware features for steganalysis of color images