DeepMind Control Suite
PulseAugur coverage of DeepMind Control Suite — every cluster mentioning DeepMind Control Suite across labs, papers, and developer communities, ranked by signal.
-
Researchers fix synthetic data failures in reinforcement learning policy optimization
Researchers have identified and addressed algorithmic failures in Model-Based Policy Optimization (MBPO), a technique used in reinforcement learning. The study found that MBPO can underperform compared to other methods …
-
HaM-World model enhances AI planning with selective memory and Hamiltonian dynamics
Researchers have introduced HaM-World, a novel structured world model designed to improve the stability and accuracy of planning in reinforcement learning. This model decomposes latent states into canonical (q, p) and c…
-
ELVIS: Ensemble-Calibrated Latent Imagination for Long-Horizon Visual MPC
Researchers have developed ELVIS, a novel approach to long-horizon visual planning in reinforcement learning that uses a Gaussian-mixture model predictive controller to maintain multiple hypotheses over extended rollout…