The Study of Feature Engineering in Machine Learning and Deep Learning for Network Intrusion Detection Systems

Publication Date

1-1-2024

Document Type

Conference Proceeding

Publication Title

2024 Silicon Valley Cybersecurity Conference, SVCC 2024

DOI

10.1109/SVCC61185.2024.10637359

Abstract

With the rise of sophisticated cyberattacks, the efficiency of intrusion detection systems becomes paramount. Machine learning (ML) and deep learning (DL) models used for intrusion detection often encounter datasets with irrelevant or redundant features, leading to low performance. To address this challenge, feature engineering techniques are important in extracting the most informative features, enabling faster and more accurate detection of malicious patterns. This paper investigates a comparative analysis of four feature engineering methods using a historical archival dataset: entropy, mutual information, chi- squared statistics, and ANOVA. By evaluating and comparing the effectiveness of these methods under different conditions of ML/DL models, this study aims to provide insights into their respective strengths and weaknesses, guiding the selection of the most suitable approach to improve the efficiency of network intrusion detection systems.

Funding Number

2244597

Funding Sponsor

National Science Foundation

Keywords

Deep Learning Models, Feature Engineering, Machine Learning Models, Network Intrusion Detection System, NSL-KDD Datasets

Department

Computer Engineering

Share

COinS