ENTITY SelectiveRM

SelectiveRM

PulseAugur coverage of SelectiveRM — every cluster mentioning SelectiveRM across labs, papers, and developer communities, ranked by signal.

Total · 30d

2

2 over 90d

Releases · 30d

0

0 over 90d

Papers · 30d

2

2 over 90d

TIER MIX · 90D

TOPICS

SENTIMENT · 30D

1 day(s) with sentiment data

RECENT · PAGE 1/1 · 2 TOTAL

RESEARCH · CL_91716 · Jun 15 · 07:39

SelectiveRM framework trains reward models to ignore noisy preferences

Researchers from Zhejiang University, Xiaohongshu, and Peking University have developed SelectiveRM, a novel framework for training reward models in large language models. This method addresses the issue of noisy prefer…
RESEARCH · CL_15878 · May 3 · 11:45

New research explores advanced reward modeling for LLMs and diffusion models

Several new research papers explore advancements in reward modeling for AI alignment, particularly for large language models and diffusion models. One paper introduces SelectiveRM, a framework using optimal transport to…