Publication Date

12-1-2025

Document Type

Article

Publication Title

Journal of Computer Virology and Hacking Techniques

Volume

Issue

DOI

10.1007/s11416-025-00568-y

Abstract

Concept drift refers to gradual or sudden changes in the properties of data that affect the accuracy of machine learning models. In this paper, we address the problem of concept drift detection in the malware domain. Specifically, we propose and analyze a lightweight clustering-based approach to detecting concept drift. Using a subset of the KronoDroid dataset, malware samples are partitioned into temporal batches and analyzed using MiniBatch K-Means clustering. The silhouette coefficient is used as a metric to identify points in time where concept drift has likely occurred. To verify our drift detection results, we train learning models under three realistic scenarios, which we refer to as static training, periodic retraining, and drift-aware retraining. In each scenario, we consider four supervised classifiers, namely, multilayer perceptron (MLP), support vector machine (SVM), random forest, and XGBoost. Experimental results demonstrate that drift-aware retraining guided by silhouette coefficient thresholding achieves classification accuracy far superior to static models and, on average, within 0.5% of periodic retraining, while also being far more efficient than periodic retraining. These results provide strong evidence that our clustering-based approach is effective at detecting concept drift, while also illustrating a highly practical and efficient fully automated approach to improved malware classification via concept drift detection.

Creative Commons License

This work is licensed under a Creative Commons Attribution-Noncommercial-No Derivative Works 4.0 License.

Department

Computer Science

Recommended Citation

Aniket Mishra and Mark Stamp. "Cluster analysis and concept drift detection in malware" Journal of Computer Virology and Hacking Techniques (2025). https://doi.org/10.1007/s11416-025-00568-y

Download

Find in your library

COinS

Faculty Research, Scholarly, and Creative Activity

Cluster analysis and concept drift detection in malware

Publication Date

Document Type

Publication Title

Volume

Issue

DOI

Abstract

Creative Commons License

Department

Recommended Citation

Search

Browse All

Links

Faculty Research, Scholarly, and Creative Activity

Cluster analysis and concept drift detection in malware

Authors

Publication Date

Document Type

Publication Title

Volume

Issue

DOI

Abstract

Creative Commons License

Department

Recommended Citation

Share

Search

Browse All

Links