PulseAugur
实时 21:51:33
实体 CiteVQA

CiteVQA

PulseAugur coverage of CiteVQA — every cluster mentioning CiteVQA across labs, papers, and developer communities, ranked by signal.

Show in brief
总计 · 30天
3
90 天内 3
发布 · 30天
0
90 天内 0
论文 · 30天
3
90 天内 3
层级分布 · 90 天
时间线
  1. 2026-05-13 research_milestone Introduction of the CiteVQA benchmark for evaluating evidence attribution in multimodal large language models. 来源
情绪 · 30 天

2 天有情绪数据

最近 · 第 1/1 页 · 共 3 条
  1. TOOL · CL_49038 ·

    GPT-4 and other AI models fail to cite sources accurately, study finds

    A new study from CiteVQA indicates that leading AI models, including GPT-4, frequently provide correct answers but struggle to reliably cite their sources. This inability to attribute information accurately raises conce…

  2. TOOL · CL_49036 ·

    AI models hallucinate citations, new benchmark reveals

    Leading AI models such as GPT and Gemini frequently provide correct answers while citing non-existent or irrelevant evidence. This phenomenon, termed "attribution hallucination" by researchers at Peking University, pose…

  3. TOOL · CL_30596 ·

    New benchmark CiteVQA exposes "Attribution Hallucination" in LLMs

    Researchers have introduced CiteVQA, a new benchmark designed to evaluate multimodal large language models (MLLMs) on their ability to accurately attribute answers to specific source regions within documents. Unlike pre…