Computer Science > Cryptography and Security
[Submitted on 2 Apr 2019 (v1), revised 19 Sep 2019 (this version, v2), latest version 2 Apr 2020 (v4)]
Title:An Efficient Intrusion Detection System Based on Feature Selection and Ensemble Classifier
View PDFAbstract:Intrusion detection systems (IDSs) are extensively used in a network topology aiming to safeguard the integrity and availability of sensitive assets in the protected systems. Although many supervised and unsupervised learning approaches from the field of machine learning have been used to increase the efficacy of IDSs, it is still a problem to deal with lots of redundant and irrelevant data in high-dimensional datasets. In this paper, we introduce a novel machine learning based IDS to increase the accuracy and effiency of classification. Due to the high dimensionality of network traffic, we propose a heuristic algorithm called Correlation-based-Feature-Selection-Bat-Algorithm (CFS-BA) to solve this problem. For feature selection, CFS-BA supports evaluation of the correlation between features and selects the optimal subset for training and testing process. Meanwhile, we introduce an ensemble approach, which combines decisions from C4.5, Random Forest (RF) and Forest by Penalizing Attributes (Forest PA) based on the average of probabilities (AOP) rule. It helps to deal with unbalanced datasets and multi-class classification. Experimental results, using KDDCup'99, NSL-KDD and CIC-IDS2017 datasets, reveal the superiority of our CFS-BA-Ensemble method, over other related techniques under several metrics. Specifically, the proposed IDS reduces the training and testing time from 113.53 and 2.93 to 44.78 and 2.06 on the CIC-IDS2017, respectively. Moreover, compared with other feature selection methods, our proposal achieves the highest F-Measure of 0.998 and lowest False Alarm Rate (FAR) of 0.17% on KDDCup'99.
Submission history
From: Yuyang Zhou [view email][v1] Tue, 2 Apr 2019 11:45:41 UTC (314 KB)
[v2] Thu, 19 Sep 2019 07:46:19 UTC (278 KB)
[v3] Mon, 30 Mar 2020 03:17:56 UTC (402 KB)
[v4] Thu, 2 Apr 2020 10:17:25 UTC (402 KB)
References & Citations
Bibliographic and Citation Tools
Bibliographic Explorer (What is the Explorer?)
Connected Papers (What is Connected Papers?)
Litmaps (What is Litmaps?)
scite Smart Citations (What are Smart Citations?)
Code, Data and Media Associated with this Article
alphaXiv (What is alphaXiv?)
CatalyzeX Code Finder for Papers (What is CatalyzeX?)
DagsHub (What is DagsHub?)
Gotit.pub (What is GotitPub?)
Hugging Face (What is Huggingface?)
Papers with Code (What is Papers with Code?)
ScienceCast (What is ScienceCast?)
Demos
Recommenders and Search Tools
Influence Flower (What are Influence Flowers?)
CORE Recommender (What is CORE?)
arXivLabs: experimental projects with community collaborators
arXivLabs is a framework that allows collaborators to develop and share new arXiv features directly on our website.
Both individuals and organizations that work with arXivLabs have embraced and accepted our values of openness, community, excellence, and user data privacy. arXiv is committed to these values and only works with partners that adhere to them.
Have an idea for a project that will add value for arXiv's community? Learn more about arXivLabs.