New TSMC method optimizes trajectories and policies with differentiable dynamics

By PulseAugur Editorial · [2 sources] · 2026-04-23 09:13

Researchers have introduced Tempered Sequential Monte Carlo (TSMC), a novel sampling-based framework for optimizing trajectories and policies within systems that have differentiable dynamics. This approach reframes controller design as an inference problem, aiming to minimize a KL-regularized expected trajectory cost. TSMC employs an annealing scheme to efficiently sample from complex target distributions by adaptively reweighting and resampling particles along a tempering path. The method has demonstrated broad applicability and superior performance compared to existing baselines in relevant benchmarks. AI

IMPACT Introduces a new optimization technique that could improve performance in robotics and control systems.

RANK_REASON This is a research paper describing a new method for trajectory and policy optimization.

Read on Hugging Face Daily Papers →

paper
other

AI-generated summary · Google Gemini · from 2 sources. How we write summaries →

New TSMC method optimizes trajectories and policies with differentiable dynamics

COVERAGE [2]

Hugging Face Daily Papers TIER_1 English(EN) · 2026-04-23 09:13

Tempered Sequential Monte Carlo for Trajectory and Policy Optimization with Differentiable Dynamics

We propose a sampling-based framework for finite-horizon trajectory and policy optimization under differentiable dynamics by casting controller design as inference. Specifically, we minimize a KL-regularized expected trajectory cost, which yields an optimal "Boltzmann-tilted" dis…
arXiv cs.LG TIER_1 English(EN) · Heng Yang · 2026-04-23 09:13

Tempered Sequential Monte Carlo for Trajectory and Policy Optimization with Differentiable Dynamics

We propose a sampling-based framework for finite-horizon trajectory and policy optimization under differentiable dynamics by casting controller design as inference. Specifically, we minimize a KL-regularized expected trajectory cost, which yields an optimal "Boltzmann-tilted" dis…

COVERAGE [2]

Tempered Sequential Monte Carlo for Trajectory and Policy Optimization with Differentiable Dynamics

Tempered Sequential Monte Carlo for Trajectory and Policy Optimization with Differentiable Dynamics

RELATED ENTITIES

RELATED TOPICS