PulseAugur
EN
LIVE 13:59:23
ENTITY LLaVA-1.5-7B

LLaVA-1.5-7B

PulseAugur coverage of LLaVA-1.5-7B — every cluster mentioning LLaVA-1.5-7B across labs, papers, and developer communities, ranked by signal.

Show in brief
Total · 30d
7
7 over 90d
Releases · 30d
0
0 over 90d
Papers · 30d
7
7 over 90d
TIER MIX · 90D
TOPICS
SENTIMENT · 30D

3 day(s) with sentiment data

RECENT · PAGE 1/1 · 7 TOTAL
  1. TOOL · CL_114371 ·

    Vision-Language Models struggle with classroom engagement recognition

    A new benchmark study evaluated five Vision-Language Models (VLMs) for their ability to recognize classroom engagement in zero-shot settings. The models, including GPT-4o and LLaVA-1.5-7B, performed poorly on individual…

  2. RESEARCH · CL_79207 ·

    New pruning techniques promise smaller models and faster training

    Researchers have developed new methods for pruning neural networks and datasets to improve efficiency. DCP-Prune focuses on ultra-low token pruning for vision models, achieving high performance with significantly fewer …

  3. RESEARCH · CL_65854 ·

    New methods drastically cut VLM visual tokens, boosting efficiency

    Researchers have developed three new methods to significantly compress the visual tokens used by large vision-language models (VLMs), aiming to reduce computational overhead and improve inference speed. InfoMerge uses t…

  4. TOOL · CL_56496 ·

    AI fine-tuned for bridge damage assessment and repair priority scoring

    Researchers have developed a method to automate bridge damage assessment and repair priority scoring using fine-tuned Vision-Language Models (VLMs). By training LLaVA-1.5-7B with a curated dataset of bridge images and i…

  5. TOOL · CL_27337 ·

    Apple researchers balance image captioning with new RL framework

    Apple researchers have developed BalCapRL, a new framework for reinforcement learning-based image captioning using multimodal large language models. This approach aims to balance multiple caption quality dimensions, inc…

  6. TOOL · CL_20781 ·

    New framework uses foundation models for car interior object detection

    Researchers have developed a novel framework called ODAL for object detection and localization within car interiors, designed to overcome the computational limitations of in-vehicle systems. This framework splits proces…

  7. RESEARCH · CL_15553 ·

    Researchers analyze metric unreliability in multimodal machine unlearning

    Researchers have identified significant unreliability in current evaluation metrics for machine unlearning in Vision-Language Models (VLMs). Analysis of 36 unlearned LLaVA-1.5-7B models revealed that standard metrics li…