IGenBench: Benchmarking the Reliability of Text-to-Infographic Generation
Researchers have introduced IGenBench, a new benchmark designed to evaluate the reliability of text-to-infographic generation models. The benchmark consists of 600 test cases across 30 infographic types, with an automated evaluation framework that uses multimodal large language models to assess accuracy. Initial testing on ten state-of-the-art text-to-image models revealed significant challenges, particularly with data-related aspects, highlighting a gap between perceived aesthetic quality and actual functional correctness. AI
IMPACT Highlights critical limitations in current text-to-infographic models, particularly concerning data accuracy, guiding future development.