HealthBench
PulseAugur coverage of HealthBench — every cluster mentioning HealthBench across labs, papers, and developer communities, ranked by signal.
1 day(s) with sentiment data
-
LLMs learn to actively seek external info for better task adaptation
Researchers have developed a new method for adapting large language models (LLMs) by enabling them to actively seek information from external sources like Wikipedia and web browsers. This approach, termed "active inform…
-
Apple's RVPO framework enhances LLM alignment by penalizing reward variance
Researchers have introduced Reward-Variance Policy Optimization (RVPO), a novel framework designed to improve the alignment of large language models with multiple objectives. Unlike existing methods that average rewards…
-
TheraAgent AI improves medical treatment planning with iterative refinement
Researchers have developed TheraAgent, a new framework designed to improve the precision and safety of treatment plans generated by large language models. Unlike traditional one-shot generation, TheraAgent employs an it…