LLaVA-665K
PulseAugur coverage of LLaVA-665K — every cluster mentioning LLaVA-665K across labs, papers, and developer communities, ranked by signal.
2 day(s) with sentiment data
-
VisNec framework boosts multimodal AI tuning by selecting essential visual data
Researchers have developed VisNec, a framework to measure and leverage visual necessity in multimodal instruction tuning. This method identifies training samples that genuinely require visual reasoning, filtering out re…
-
AI agents can automate data curation, but need structured guidance
Researchers have developed Curation-Bench, a new benchmark designed to evaluate the ability of generalist coding agents to automate the data curation process for AI model training. Initial tests show that agents can per…
-
New frameworks enhance multimodal LLM tuning and efficiency
Researchers have introduced two new frameworks to improve multimodal instruction tuning for large language models. The SAME framework addresses issues of "router drift" and "expert drift" in continual learning by stabil…