Author

Manav Bhasin

Publication Date

Spring 2025

Degree Type

Master's Project

Degree Name

Master of Science in Computer Science (MSCS)

Department

Computer Science

First Advisor

Faranak Abri

Second Advisor

Sayma Akther

Third Advisor

Narayan Balasubrmanian

Keywords

Topic Modeling, Large Languge Models, Social Engineering

Abstract

Detecting social engineering attempts is crucial for security, as these threats are becoming more frequent and increasingly exploit human vulnerabilities. This research focuses on topic modeling using conversational data from Kevin Mitnick’s ”The Art of Deception” with dialogues that illustrate various social engineering strategies. The dataset comprises manually extracted and synthetically augmented conversations to ensure natural dialogue flow. Two methodologies are presented for utterance-level and global topic extraction: prompt engineering leveraging OpenAI’s GPT-4o-mini, characterized by few-shot learning and chain-of-thought prompting, and Quantized Low Rank Adaptation (QLoRA) utilizing Mistral’s 7B instruct model for efficient fine-tuning. Through experimentation and evaluation, this study aims to determine the effectiveness of topic modeling approaches in enhancing the identification and classification of social engineering scenarios

Available for download on Wednesday, May 20, 2026

Share

COinS