LLaVA-1.5-7B
PulseAugur coverage of LLaVA-1.5-7B — every cluster mentioning LLaVA-1.5-7B across labs, papers, and developer communities, ranked by signal.
1 day(s) with sentiment data
-
Apple researchers balance image captioning with new RL framework
Apple researchers have developed BalCapRL, a new framework for reinforcement learning-based image captioning using multimodal large language models. This approach aims to balance multiple caption quality dimensions, inc…
-
New framework uses foundation models for car interior object detection
Researchers have developed a novel framework called ODAL for object detection and localization within car interiors, designed to overcome the computational limitations of in-vehicle systems. This framework splits proces…
-
Researchers analyze metric unreliability in multimodal machine unlearning
Researchers have identified significant unreliability in current evaluation metrics for machine unlearning in Vision-Language Models (VLMs). Analysis of 36 unlearned LLaVA-1.5-7B models revealed that standard metrics li…