Back to Search Start Over

A Review of Tabular Data Synthesis Using GANs on an IDS Dataset

Authors :
Theodore Zahariadis
Stavroula Bourou
Terpsichori-Helen Velivassaki
Andreas El Saer
Artemis Voulkidis
Source :
Information, Volume 12, Issue 9, Information, Vol 12, Iss 375, p 375 (2021)
Publication Year :
2021
Publisher :
MDPI AG, 2021.

Abstract

Recent technological innovations along with the vast amount of available data worldwide have led to the rise of cyberattacks against network systems. Intrusion Detection Systems (IDS) play a crucial role as a defense mechanism in networks against adversarial attackers. Machine Learning methods provide various cybersecurity tools. However, these methods require plenty of data to be trained efficiently, which may be hard to collect or to use due to privacy reasons. One of the most notable Machine Learning tools is the Generative Adversarial Network (GAN), and it has great potential for tabular data synthesis. In this work, we start by briefly presenting the most popular GAN architectures, VanillaGAN, WGAN, and WGAN-GP. Focusing on tabular data generation, CTGAN, CopulaGAN, and TableGAN models are used for the creation of synthetic IDS data. Specifically, the models are trained and evaluated on an NSL-KDD dataset, considering the limitations and requirements that this procedure needs. Finally, based on certain quantitative and qualitative methods, we argue and evaluate the most prominent GANs for tabular network data synthesis.

Details

ISSN :
20782489
Volume :
12
Database :
OpenAIRE
Journal :
Information
Accession number :
edsair.doi.dedup.....028794803af480bff772dd8c0c52c945
Full Text :
https://doi.org/10.3390/info12090375