Intrusion Detection System model.

Source publication

Machine Learning Approach for Detection of nonTor Traffic

Conference Paper

Full-text available

Aug 2017

Intrusion detection has attracted a considerable interest from researchers and industries. After many years of research the community still faces the problem of building reliable and efficient intrusion detection systems (IDS) capable of handling large quantities of data with changing patterns in real time situations. The Tor network is popular in...

Context 1

... detection system is a software application or a device placed at strategic places on a network to monitor and detect anomalies in network traffic [1][2] as shown in Fig. 1. The main features of IDS are to raise an alarm when an anomaly is detected. A complementary approach is to take corrective measures when anomalies are detected, such an approach is referred to as an intrusion Prevention System (IPS) [3]. Based on the interactivity property of IDS, it can be designed to work either on-line or off- ...

View in full-text

Machine Learning Approach for Detectionof nonTor Traffic

Article

Full-text available

Apr 2017

Intrusion detection has attracted a considerable interest from researchers and industry. After many years of research the community still faces the problem of building reliable and efficient intrusion detection systems (IDS) capable of handling large quantities of data with changing patterns in real time situations. The Tor network is popular in pr...

Darknet Traffic Analysis: A Systematic Literature Review

Article

Full-text available

Jan 2024

The primary objective of an anonymity tool is to protect the anonymity of its users through the implementation of strong encryption and obfuscation techniques. As a result, it becomes very difficult to monitor and identify users’ activities on these networks. Moreover, such systems have strong defensive mechanisms to protect users against potential risks, including the extraction of traffic characteristics and website fingerprinting. However, the strong anonymity feature also functions as a refuge for those involved in illicit activities who aim to avoid being traced on the network. As a result, a substantial body of research has been undertaken to examine and classify encrypted traffic using machine-learning techniques. This paper presents a comprehensive examination of the existing approaches utilized for the categorization of anonymous traffic as well as encrypted network traffic inside the darknet. Also, this paper presents a comprehensive analysis of methods of darknet traffic using ML (machine learning) techniques to monitor and identify the traffic attacks inside the darknet.

Classifying Tor Traffic Encrypted Payload Using Machine Learning

Article

Full-text available

Jan 2024

Tor, a network offering Internet anonymity, presented both positive and potentially malicious applications, leading to the need for efficient Tor traffic monitoring. While most current traffic classification methods rely on flow-based features, these can be unreliable due to factors like asymmetric routing, and the use of multiple packets for feature computation can lead to processing delays. Recognising the multi-layered encryption of Tor compared to nonTor encrypted payloads, our study explored distinct patterns in their encrypted data. We introduced a novel method using Deep Packet Inspection and machine learning to differentiate between Tor and nonTor traffic based solely on encrypted payload. In our first strand of research, we investigated hex character analysis of the Tor and nonTor encrypted payloads through statistical testing across 8 groups of application types. Remarkably, our investigation revealed a significant differentiation rate of 94.53% between Tor and nonTor traffic. In the second strand of our research, we aimed to distinguish Tor and nonTor traffic using machine learning, based on encrypted payload features. This proposed feature-based approach proved effective, as evidenced by our classification performance, which attained an average accuracy rate of 95.65% across these 8 groups of applications. Thereby, this study contributes to the efficient classification of Tor and nonTor traffic through features derived solely from a single encrypted payload packet, independent of its position in the traffic flow.

Hybrid intelligent feature selector framework for darknet traffic classification

Article

Full-text available

Oct 2023
MULTIMED TOOLS APPL

With the rapid growth in the internet traffic, analysis and exploration of traffic classification has become more challenging task especially for dark net or dark web. Darknet, within the confines of deep web, has been witnessing illegitimate doings such as drug trafficking, terrorism, betting etc. Hence classification of darknet traffic is an important task. This paper presents intelligent framework for the darknet traffic categorization with proposed hybrid feature selector. The darknet traffic consists of data packet related information as input predictors for the model. Twenty-one machine learning models with and without feature selectors are presented to categorize the darknet traffic. We propose a Hybrid LASSO-Random Forest (HLRF) feature selector to reduce feature dimensionality. Classification of darknet traffic is evaluated with well-known KNN, Extra Tree and XGBoost classifiers. The performance of proposed models was assessed in terms of Accuracy, Precision, Recall, Harmonic Mean Value of True Positive Value (TPV) and Positive Predicted Value (PPV), Matthews Corelation Coefficients (MCC) and Jaccard Score. The experimental results approve that XGBoost with proposed HLRF feature selector outperforms in categorization of darknet traffic. The results revel that proposed XGBoost-HLRF model obtain an accuracy and recall value of 98.10% with precision of 98.12%. Comparison of XGBoost-HLRF model with other proposed state of art models are presented for performance assessment of our model.

AE-DTI: An Efficient Darknet Traffic Identification Method Based on Autoencoder Improvement

Article

Full-text available

Aug 2023

With the continuous expansion of the darknet and the increase in various criminal activities in the darknet, darknet traffic identification has become increasingly essential. However, existing darknet traffic identification methods rely on all traffic characteristics, which require a long computing time and a large amount of system resources, resulting in low identification efficiency. To this end, this paper proposes an autoencoder-based darknet traffic identification method (AE-DTI). First, AE-DTI maps the feature values to pixels of a two-dimensional grayscale image after deduplication and denoising of the darknet traffic dataset. Then, AE-DTI designs a new feature selection algorithm (AE-FS) to downscale the grayscale graph, and AE-FS trains a feature scoring network, which globally scores all the features based on the reconstruction error to select the features with scores greater than or equal to a set threshold value. Finally, AE-DTI uses a one-dimensional convolutional neural network with a dropout layer to identify darknet traffic on the basis of alleviating overfitting. Experimental results on the ISCXTor2016 dataset show that, compared with other dimensionality reduction methods (PCA, LLE, ISOMAP, and autoencoder), the classification model trained with the data obtained from AE-FS has a significant improvement in classification accuracy and classification efficiency. Moreover, AE-DTI also shows significant improvement in recognition accuracy compared with other models. Experimental results on the CSE-CIC-IDS2018 dataset and CIC-Darknet2020 dataset show that AE-DTI has strong generalization.

Discerning the traffic in anonymous communication networks using machine learning: concepts, techniques and future trends

Article

Mar 2023
Int J Inform Decis Sci

With the growing need for anonymity and privacy on the Internet, Anonymous Communication Networks (ACNs) such as Tor, I2P, JonDonym, and Freenet have risen to fame. Such anonymous networks aim to provide freedom of expression and protection against tracking to its users. Simultaneously, there is also a class of users involved in the illegal usage of these ACNs. An emerging research topic in the field of ACNs is network traffic classification, as it can improve the network security against illegal users as well as improve the Quality of Service for its legal users. In this study, we review the research works available in the literature relevant to traffic classification in ACNs based on Machine Learning and also present to the researchers the general concepts and techniques in this area. A discussion on future trends in this area is also provided to bring out the future enhancements required in ML-based network traffic classification in ACNs.

Adversarial Malicious Encrypted Traffic Detection Based on Refined Session Analysis

Article

Full-text available

Nov 2022

The detection of malicious encrypted traffic is an important part of modern network security research. The producers of the current malware do not pay attention to the fact that malicious encrypted traffic can also be detected; they do not construct further adversarial malicious encrypted traffic to deceive existing malicious encrypted traffic detection methods. However, with the increasing confrontation between attack and defense, adversarial malicious encrypted traffic samples will appear gradually, which will make the existing malicious encrypted traffic detection methods obsolete. In this paper, an adversarial malicious encrypted traffic detection method based on refined session analysis (ADRSA) is proposed. The key ideas of this method are: 1) interpretability analysis is used to extract malicious traffic features that are not easily affected by encryption, 2) restoration technology is used to further improve traffic separability, and 3) a deep neural network is used to identify adversarial malicious encrypted traffic. In experimental tests, the ADRSA method could accurately detect malicious encrypted traffic, particularly adversarial malicious encrypted traffic, and the detection rate is more than 95%. However, the detection rate of other malicious encrypted traffic detection methods is almost zero when facing adversarial malicious encrypted traffic. The detection performance of ADRSA exceeds that of the most popular detection methods.

Machine Learning-Based Load Forecasting for Nanogrid Peak Load Cost Reduction

Article

Full-text available

Sep 2022

Increased focus on sustainability and energy decentralization has positively impacted the adoption of nanogrids. With the tremendous growth, load forecasting has become crucial for their daily operation. Since the loads of nanogrids have large variations with sudden usage of large household electrical appliances, existing forecasting models, majorly focused on lower volatile loads, may not work well. Moreover, abrupt operation of electrical appliances in a nanogrid, even for shorter durations, especially in “Peak Hours,” raises the energy cost substantially. In this paper, an ANN model with dynamic feature selection is developed to predict the hour-ahead load of nanogrids based on meteorological data and a load lag of 1 h (t-1). In addition, by thresholding the predicted load against the average load of previous hours, peak loads, and their time indices are accurately identified. Numerical testing results show that the developed model can predict loads of nanogrids with the Mean Square Error (MSE) of 0.03 KW, the Mean Absolute Percentage Error (MAPE) of 9%, and the coefficient of variation (CV) of 11.9% and results in an average of 20% daily energy cost savings by shifting peak load to off-peak hours. Keywords: nanogrids; peak load; load forecasting; artificial neural network (ANN); machine learning; microgrids

Fusion Dilated CNN for Encrypted Web Traffic Classification

Article

Full-text available

Jul 2022

A Growing number of conventional Convolutional neural network (CNN) models have been employed for encrypted web traffic characterization. However, the application of CNN models is confronted with two significant challenges; a) they possess short reflective fields that don't gather much-encrypted traffic information for effective and accurate predictions. b) these models are not adaptive to the diverse nature of traffic flow because of their single-head architecture. This paper alleviates these problems using the fusion of dilated Convolutional neural networks dubbed FDCNN. FDCNN architecture supports exponentially large receptive fields and captures local dependencies in encrypted traffic data. The experimental results on public datasets, ISCX VPNnon-VPN Traffic datasets, indicate that FDCNN architecture is practical and achieves higher accuracy.

Assessment of Network Intrusion Detection System Based on Shallow and Deep Learning Approaches

Chapter

May 2022

This extensive review aims to classify the Intrusion Detection System (IDS) and various machine learning and deep learning (ML/DL) approaches used for IDS. The survey also addresses security, which is a concern with the Internet of Things. Several types of intrusion detection systems (IDSs), including shallow and deep learning methods and various learning algorithms to aid intrusion detection, are also categorized. This research expands on Network Intrusion Detection Systems and investigates techniques for improving their performance. It provides a more comprehensive understanding of deep and shallow learning methodologies with their benefits and drawbacks. The study component examines IDS classification, feature extraction techniques, machine learning, deep learning, and examples of how these may be applied. The essence of this review will establish a viable approach to assist professionals in modeling trustworthy and powerful IDS based on real-time requirements. Because the methods of intrusions and cyberattacks in networks are constantly evolving, it attracted the interest of many scholars and industrial professionals. However, cyber specialists struggle to develop an accurate and effective Intrusion Detection System (IDS). In addition, an increasing number of devices has resulted in more complicated network topology, raising security risks. As a result, a lengthy and exhaustive review is indispensable while developing a secure communication system.

A CNN based Encrypted Network Traffic Classifier

Conference Paper

Full-text available

Feb 2022

The internet is responsible for global connectivity and ensuring its safety is a paramount task for governments and organisations. Cybersecurity concerns led to the encryption of over 87% of internet traffic. Encryption ensures security by improving privacy between sender and receiver but creates a problem of inaccurate traffic classification. Previous papers have used Artificial Intelligence to address this problem, however issues such as model simplicity, complexity, imbalanced dataset etc, are problems yet to be addressed. Overfitting, underfitting and ultimately poor classification are outcomes of poorly designed models. This paper applies deep learning to the problem of encrypted traffic classification. A Convolutional Neural Network (CNN) is used to address this problem. An eleven layered architecture is designed and trained with a range of images generated from the metadata of encrypted traffic. At its core, the design is made less complex for understandability and deals with overfitting. The proposed model is assessed with the standard metrics of accuracy, precision, recall and 1 score then compared to a baseline model. The model is trained and tested for seven classification problems, using three encryption types (https, vpn, tor). For all classification tasks, the proposed model achieved accuracies ranging from 91%-99%, which is an indication of optimum generalization strength. Our model outperformed the baseline model which had accuracies ranging from 67.6%-99%, an indication of poor generalization strength.

Intrusion Detection System model.

Context in source publication

Similar publications

Citations