Uncovering Vulnerabilities of LLM-Assisted Cyber Threat Intelligence
A new research paper explores the vulnerabilities of large language models (LLMs) when applied to cyber threat intelligence (CTI). The study identifies three specific cognitive failures in LLMs within CTI workflows: spurious correlations from metadata, contradictory knowledge from conflicting sources, and limited generalization to new threats. Researchers developed a human-in-the-loop framework to label these failures and demonstrated that targeted defenses can significantly reduce error rates, offering a path toward more resilient CTI agents. AI
IMPACT Identifies specific failure modes of LLMs in CTI, guiding development of more robust security tools.