ENTITY HalfCheetah-v4

HalfCheetah-v4

PulseAugur coverage of HalfCheetah-v4 — every cluster mentioning HalfCheetah-v4 across labs, papers, and developer communities, ranked by signal.

Total · 30d

2

2 over 90d

Releases · 30d

0

0 over 90d

Papers · 30d

2

2 over 90d

TIER MIX · 90D

TOPICS

SENTIMENT · 30D

1 day(s) with sentiment data

RECENT · PAGE 1/1 · 2 TOTAL

RESEARCH · CL_99607 · Jun 18 · 00:00

New research explores RL efficiency, reward-free control, and safe navigation

Researchers are exploring novel approaches in reinforcement learning (RL) to enhance efficiency and performance across various domains. One study investigates the "rollout infrastructure tax" in coding-agent RL, reveali…
TOOL · CL_21988 · May 8 · 04:00

New Pair-GRPO algorithms enhance LLM alignment stability and generalization

Researchers have introduced the Pair-GRPO family, a novel theoretical framework designed to enhance the stability and generality of reinforcement learning for aligning large language models. This family includes two var…