New framework boosts VLM chart understanding with counterfactual data

作者 PulseAugur 编辑部 · [1 个来源] · 2026-05-11 17:02

Researchers have developed ChartCF, a new framework to improve the data efficiency of vision-language models (VLMs) used for chart understanding. This method leverages counterfactual data synthesis, where small code-controlled changes in charts can lead to significant semantic shifts. ChartCF also incorporates a chart similarity-based data selection strategy and multimodal preference optimization to enhance training efficiency and performance on chart-related tasks. AI

影响 Enhances data efficiency for chart understanding models, potentially reducing training costs and accelerating deployment.

排序理由 The cluster contains an academic paper detailing a new method for improving AI model performance. [lever_c_demoted from research: ic=1 ai=1.0]

在 arXiv cs.CL 阅读 →

AI 生成摘要 · Google Gemini · 来自 1 个来源。我们如何撰写摘要 →

报道来源 [1]

arXiv cs.CL TIER_1 English(EN) · Wenya Wang · 2026-05-11 17:02

Learning More from Less: Exploiting Counterfactuals for Data-Efficient Chart Understanding

Vision-Language Models (VLMs) have demonstrated remarkable progress in chart understanding, largely driven by supervised fine-tuning (SFT) on increasingly large synthetic datasets. However, scaling SFT data alone is inefficient and overlooks a key property of charts: charts are p…

报道来源 [1]

Learning More from Less: Exploiting Counterfactuals for Data-Efficient Chart Understanding

相关实体

相关话题