PulseAugur
EN
LIVE 20:39:26
ENTITY Multi-modal Large Language Models

Multi-modal Large Language Models

PulseAugur coverage of Multi-modal Large Language Models — every cluster mentioning Multi-modal Large Language Models across labs, papers, and developer communities, ranked by signal.

Show in brief
Total · 30d
8
8 over 90d
Releases · 30d
0
0 over 90d
Papers · 30d
8
8 over 90d
TIER MIX · 90D
TOPICS
SENTIMENT · 30D

4 day(s) with sentiment data

RECENT · PAGE 1/1 · 8 TOTAL
  1. RESEARCH · CL_107688 ·

    New 'Ground Then Rank' method boosts knowledge-based visual question answering

    Researchers have developed a new framework called "Ground Then Rank" (GTR) to improve Knowledge-Based Visual Question Answering (KB-VQA) performance. This method decouples entity identification from evidence ranking, ad…

  2. RESEARCH · CL_105257 ·

    New benchmarks and methods tackle visual document retrieval challenges

    Researchers have developed new methods to improve visual document retrieval, particularly for large collections of similar documents like invoices. One approach, Invoice Haystack, introduces a benchmark designed to stre…

  3. RESEARCH · CL_84430 ·

    New TASM framework boosts MLLM efficiency with structured memory

    Researchers have developed a new framework called TASM (Task-Aware Structured Memory) to improve the efficiency of multi-modal large language models (MLLMs). This training-free approach addresses the limitations of curr…

  4. RESEARCH · CL_79694 ·

    New benchmarks and frameworks enhance video temporal grounding

    Researchers have introduced new benchmarks and frameworks for improving temporal grounding in long-form videos. One study posits that hour-scale video grounding is primarily a search problem, not a recognition one, and …

  5. RESEARCH · CL_79606 ·

    LLM privacy research tackles Japanese data, multi-modal risks, and DP adaptation

    Researchers are exploring privacy risks associated with large language models (LLMs) and their adaptations. One study focuses on detecting sensitive personal information in Japanese pre-training corpora, developing a cl…

  6. TOOL · CL_65341 ·

    Survey details LLM and MM-LLM use in transportation operations

    A new survey paper explores the application of large language models (LLMs) and multi-modal large language models (MM-LLMs) in transportation systems management and operations. The research synthesizes current studies a…

  7. RESEARCH · CL_36921 ·

    AI agents learn human beliefs and spatial reasoning

    Researchers are exploring how AI agents can better understand human beliefs and intentions, particularly in interactive scenarios. One paper proposes a second-order Theory of Mind (ToM-2) framework using I-POMDP to enab…

  8. RESEARCH · CL_27982 ·

    AI research questions video anomaly detection framing

    Two new research papers challenge the current direction of video anomaly detection (VAD). The first paper argues that the field's focus on general models and multi-modal large language models (MLLMs) has shifted focus a…