Publication Date
Spring 2025
Degree Type
Master's Project
Degree Name
Master of Science in Computer Science (MSCS)
Department
Computer Science
First Advisor
Fabio Di Troia
Second Advisor
Navrati Saxena
Third Advisor
Thomas Austin
Keywords
Convolutional Neural Network, PixelCNN, Malware, Deep Learning, Machine Learning
Abstract
Malware poses a serious threat to both data privacy and system security. With the wide variety of malware families and the surge in cyber-attacks, the accurate classification of malware is crucial for building effective detection and prevention systems. In recent years, deep learning (DL) methods in computer vision have shown promise in classifying malware by converting malware files into visual representations and applying DL algorithms to classify the resulting images. Among the different approaches to malware family classification, image-based methods have gained significant interest. This research focuses on leveraging DL techniques for image-based classification of malware. The success of identifying malicious files largely depends on the quality and size of the training dataset, as well as its authenticity. However, a major obstacle in utilizing DL for malware detection is the shortage of training data. To address this challenge, this project utilizes the Pixel Convolutional Neural Network (PixelCNN) architecture to generate synthetic, class-specific malware images pixel by pixel, capturing intricate details of the images. Additionally, a CNN-based DL model was developed, demonstrating that training with a mixture of at least 15% real malware data alongside the generated images resulted in notable multi-class classification accuracy.
Recommended Citation
Karumudi, Mounika Krishna Teja, "Malware Generation and Classification using PixelCNN" (2025). Master's Projects. 1556.
DOI: https://doi.org/10.31979/etd.gs68-5tdh
https://scholarworks.sjsu.edu/etd_projects/1556