PulseAugur
EN
LIVE 21:34:00

New benchmark tests VLM understanding of Japanese charts and tables

Researchers have developed HakushoBench, a new benchmark for evaluating vision-language models (VLMs) on their ability to understand Japanese charts and tables. The dataset is derived from 33 Japanese governmental white papers, containing over 2,000 images and manually annotated question-answer pairs. Initial experiments show a significant performance gap between open-weight and proprietary models, indicating substantial room for improvement in VLM capabilities for complex, non-English document analysis. AI

IMPACT Establishes a new evaluation standard for VLM performance on non-English visual data, potentially driving improvements in multilingual document understanding.

RANK_REASON The cluster describes a new academic benchmark dataset for evaluating AI models.

Read on Hugging Face Daily Papers →

AI-generated summary · Google Gemini · from 2 sources. How we write summaries →

COVERAGE [2]

  1. Hugging Face Daily Papers TIER_1 English(EN) ·

    HakushoBench: A Japanese Chart and Table VQA Benchmark from Governmental White Papers

    Researchers created HakushoBench, a Japanese chart and table visual question answering benchmark derived from governmental documents, to evaluate vision-language models' ability to understand complex visual data beyond English-language datasets.

  2. arXiv cs.CV TIER_1 English(EN) · Issa Sugiura, Shuhei Kurita, Yusuke Oda, Naoaki Okazaki ·

    HakushoBench: A Japanese Chart and Table VQA Benchmark from Governmental White Papers

    arXiv:2606.01132v1 Announce Type: new Abstract: Understanding chart and table images is essential for applying vision-language models (VLMs) to real-world document understanding. While English benchmarks have advanced rapidly, non-English counterparts remain scarce, leaving it un…