TABVERSE: Benchmarking Cross-Format Table Understanding in LLMs and VLMs
Researchers have introduced TABVERSE, a new benchmark designed to evaluate how well Large Language Models (LLMs) and Vision-Language Models (VLMs) understand tables across different formats. The benchmark standardizes table content while varying its representation, such as HTML, Markdown, LaTeX, and rendered images. Initial findings indicate that model performance is significantly influenced by the table's format, with structured text generally outperforming images, though specific tasks and formats present unique challenges. AI
IMPACT Highlights the impact of data representation on LLM/VLM performance, suggesting a need for robust cross-format handling in future model development.