Researchers have introduced AdaJudge, a novel framework designed to enhance the accuracy of reward modeling in large language models. This approach tackles limitations in current static pooling strategies by adapting both the model's representations and its aggregation methods. AdaJudge employs gated refinement blocks to create discrimination-oriented representations and an adaptive multi-view pooling module for dynamic evidence combination. Experiments on RM-Bench and JudgeBench demonstrate AdaJudge's superior performance compared to existing reward models and pooling baselines. AI
IMPACT Enhances LLM alignment by improving reward modeling, potentially leading to more nuanced and human-aligned AI behavior.
RANK_REASON This is a research paper detailing a new method for reward modeling in LLMs. [lever_c_demoted from research: ic=1 ai=1.0]
AI-generated summary · Google Gemini · from 1 sources. How we write summaries →