VQA v2
PulseAugur coverage of VQA v2 — every cluster mentioning VQA v2 across labs, papers, and developer communities, ranked by signal.
2 day(s) with sentiment data
-
New ASR method prevents multimodal LLMs from forgetting skills
Researchers have introduced Attention-Spectrum Regularization (ASR), a novel framework designed to prevent multimodal large language models (MLLMs) from forgetting previously learned skills when adapting to new data. AS…
-
ITNet architecture unifies convolution, attention, and recurrence
Researchers have introduced ITNet, a novel neural network architecture that unifies convolution, attention, and recurrence into a single learnable integral transform. This architecture uses a learnable kernel, implement…
-
New research tackles continual learning in LLMs with novel MoE methods
Two new research papers propose novel approaches to continual learning in large language and vision-language models, aiming to mitigate catastrophic forgetting. CP-MoE introduces a transient expert to guide updates and …
-
Researchers unveil new stealthy backdoor attacks on AI models using diffusion and style features
Researchers have developed new methods for backdoor attacks on advanced AI models, specifically targeting Vision-Language Models (VLMs) and Diffusion Models (DMs). One approach, CBV, uses diffusion models to create natu…