Researchers use RL to improve MLLM regression on imbalanced data

作者 PulseAugur 编辑部 · [1 个来源] · 2026-05-05 04:00

Researchers have developed a new framework to improve how multimodal large language models (MLLMs) handle numerical regression tasks, particularly those with imbalanced data distributions. Existing training methods often lead to poor performance on less frequent data points. The proposed solution uses a distribution-aware reinforcement learning approach with batch-level supervision to better align predicted and actual data distributions. Experiments demonstrated significant improvements over standard fine-tuning methods, especially in scenarios with limited training examples. AI

影响 Introduces a novel training paradigm for MLLMs to enhance performance on imbalanced regression datasets, potentially improving their utility in real-world applications with skewed data.

排序理由 This is a research paper detailing a new framework for improving MLLM performance on specific regression tasks. [lever_c_demoted from research: ic=1 ai=1.0]

在 arXiv cs.CV 阅读 →

AI 生成摘要 · Google Gemini · 来自 1 个来源。我们如何撰写摘要 →

报道来源 [1]

arXiv cs.CV TIER_1 English(EN) · Yao Du, Shanshan Li, Xiaomeng Li · 2026-05-05 04:00

Injecting Distributional Awareness into MLLMs via Reinforcement Learning for Deep Imbalanced Regression

arXiv:2605.01402v1 Announce Type: cross Abstract: Multimodal large language models (MLLMs) struggle with numerical regression under long-tailed target distributions. Token-level supervised fine-tuning (SFT) and point-wise regression rewards bias learning toward high-density regio…

报道来源 [1]

Injecting Distributional Awareness into MLLMs via Reinforcement Learning for Deep Imbalanced Regression

相关实体

相关话题