MICA framework enhances LLM emotional support dialogues with novel RL approach

By PulseAugur Editorial · [1 sources] · 2026-05-06 04:00

Researchers have introduced MICA, a novel reinforcement learning framework designed to improve the performance of large language models in multi-turn emotional support dialogues. This critic-free approach addresses challenges like sparse rewards and poor credit assignment by deriving both immediate and delayed credit from a shared potential function. MICA utilizes an Incremental Distance Reward for per-turn optimization and its Monte Carlo return for delayed effects, demonstrating significant improvements on benchmarks like EMPA, EQ-Bench, and EmoBench when tested with Qwen models. AI

IMPACT Introduces a new RL framework that could enhance the capabilities of conversational AI in complex, multi-turn interactions.

RANK_REASON This is a research paper detailing a new framework for reinforcement learning in LLMs. [lever_c_demoted from research: ic=1 ai=1.0]

Read on arXiv cs.CL →

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

MICA framework enhances LLM emotional support dialogues with novel RL approach

COVERAGE [1]

arXiv cs.CL TIER_1 English(EN) · Naifan Zhang, Ruihan Sun, Jinwei Su, Hengjie Yang, Zhengyuan Pan, Zhaohan Chen, Xiaofan Zhang · 2026-05-06 04:00

MICA: Multi-granularity Intertemporal Credit Assignment for Long-Horizon Emotional Support Dialogue

arXiv:2603.06194v2 Announce Type: replace Abstract: Reinforcement learning (RL) for large language models (LLMs) has shown strong performance in single-turn tasks, but extending it to multi-turn interaction remains challenging due to sparse rewards and poor per-turn credit assign…

COVERAGE [1]

MICA: Multi-granularity Intertemporal Credit Assignment for Long-Horizon Emotional Support Dialogue

RELATED ENTITIES

RELATED TOPICS