The Power of Patterns in Detecting News Articles Written by AI
Publication Date
9-19-2025
Document Type
Conference Proceeding
Publication Title
Proceedings 2025 IEEE International Conference on Artificial Intelligence Testing Aitest 2025
DOI
10.1109/AITest66680.2025.00010
First Page
34
Last Page
41
Abstract
In the age of advanced AI text generation, distinguishing between human and machine-generated content has become increasingly challenging yet crucial. This research addresses this problem in news media, where maintaining information integrity is essential. We implement a novel approach for detecting AI-generated news articles by transforming text into two types of vector representations: semantic embeddings using OpenAI's embedding model and syntactic patterns through Part-of-Speech (POS) tagging. These vectors are then analyzed using unsupervised machine learning models, including K-Means, Hierarchical Clustering, and Gaussian Mixture Models (GMM). Our results demonstrate that GMM achieves exceptional performance with 99.6% accuracy, significantly outperforming existing resource-intensive detection methods like Ghostbuster. This approach is significantly less expensive and offers superior detection capabilities. Future work will involve testing with other large language models such as LLaMA and Gemini, and exploring advanced feature engineering techniques to further improve detection as AI text generation evolves.
Keywords
AI-Text, ChatGPT, Classification, Embedding, News Articles
Department
Computer Science
Recommended Citation
Lilou Sicard-Noel, Eric Wu, and Mike Wu. "The Power of Patterns in Detecting News Articles Written by AI" Proceedings 2025 IEEE International Conference on Artificial Intelligence Testing Aitest 2025 (2025): 34-41. https://doi.org/10.1109/AITest66680.2025.00010