Synthetic Malware Image Generation Based on Generative Models Against Zero-Day Attacks
Publication Date
8-27-2025
Document Type
Conference Proceeding
Publication Title
2025 Silicon Valley Cybersecurity Conference Svcc 2025
DOI
10.1109/SVCC65277.2025.11133622
Abstract
Malware detection is a critical task for protecting our assets from attacks. However, traditional approaches to malware detection often struggle with limited datasets, which hinder the effectiveness of machine learning models. In this paper, we propose a malware image generation system designed to craft high-quality synthetic malware image samples, addressing the challenges posed by small datasets in malware detection. The proposed system utilizes two popular generative models, WGAN-GP and Diffusion, to generate synthetic malware images. It converts malware binary files into image files using four different color spaces: monochrome, grayscale, RGB, and CMYK. These images are then evaluated based on key performance metrics such as accuracy, precision, and recall. A cosine similarity evaluation system is employed to filter high-quality samples and enhance data quality. The experimental results demonstrate that the system generates synthetic samples that improve the performance of malware detection models. The findings indicate that synthetic image-based representations are effective for malware detection tasks.
Funding Number
2244597
Funding Sponsor
National Science Foundation
Keywords
color space, Diffusion, Image-Based Malware Classification, Malware, Wasserstein General Adversarial Network with Gradient Penalty (WGAN-GP)
Department
Computer Science; Computer Engineering
Recommended Citation
Arjun Sudheer, Ayesha Ahmed, Fabio Di Troia, and Younghee Park. "Synthetic Malware Image Generation Based on Generative Models Against Zero-Day Attacks" 2025 Silicon Valley Cybersecurity Conference Svcc 2025 (2025). https://doi.org/10.1109/SVCC65277.2025.11133622