MASCOT-Android: A Curated Dataset and Automated Collection Pipeline for Android Malware Source Code Specimens
Researchers have developed MASCOT-Android, a new dataset and automated pipeline for collecting Android malware source code from GitHub. The system utilizes a LinearSVC classifier trained on TF-IDF features from README documents to identify malware repositories with high accuracy. This approach significantly reduces the cost and effort associated with manual review, enabling scalable discovery of malware source code. AI