Multilingual Fact-Checking at Scale: Fine-Tuned Compact Models vs LLMs
Researchers have developed M4FC, a new dataset for multimodal fact-checking that includes over 4,900 images and 6,900 claims in up to ten languages, verified by professionals. This dataset supports six distinct fact-checking tasks, aiming to overcome limitations of existing resources. Separately, a study at Factiverse compared fine-tuned compact models against large language models like GPT-5.2 and Claude Opus 4.6 for multilingual fact-checking, finding that specialized models offer efficiency and competitive performance for production systems. AI
IMPACT Advances in multilingual fact-checking datasets and efficient model architectures could improve the scalability and accuracy of combating misinformation across different languages.