E-VQA
PulseAugur coverage of E-VQA — every cluster mentioning E-VQA across labs, papers, and developer communities, ranked by signal.
2 day(s) with sentiment data
-
New ProMSA agent enhances knowledge-based visual question answering
Researchers have introduced ProMSA, a novel agent designed for Knowledge-Based Visual Question Answering (KB-VQA). Unlike previous methods that rely on fixed retrieval pipelines, ProMSA progressively selects between ima…
-
New MAD-RAG method tackles Attention Distraction in LVLMs
Researchers have identified a new failure mode in retrieval-augmented large vision-language models (LVLMs) called Attention Distraction (AD). This occurs when highly relevant retrieved text globally suppresses visual at…
-
New VQA methods enhance explainability and knowledge integration for multimodal LLMs
Researchers have developed CoExVQA, a new framework for Document Visual Question Answering (DocVQA) that enhances explainability by breaking down the reasoning process. This method first identifies relevant evidence, th…