New method uses VAEs and Mahalanobis distance for OOD detection in RL control

By PulseAugur Editorial · [1 sources] · 2026-06-11 04:00

Researchers have developed a novel method for detecting out-of-distribution (OOD) observations in time-varying systems, particularly for safety-critical applications like particle accelerator control. The approach utilizes a Variational Autoencoder (VAE) trained on normal operational data to identify unseen scenarios. By measuring the Mahalanobis distance in the VAE's latent space, the system can accurately distinguish between in-distribution and OOD states, enabling a seamless switch between a fast Reinforcement Learning (RL) controller and a robust extremum seeking (ES) controller. AI

RANK_REASON Academic paper detailing a new method for OOD detection in control systems. [lever_c_demoted from research: ic=1 ai=1.0]

Read on arXiv cs.LG →

paper
safety

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

COVERAGE [1]

arXiv cs.LG TIER_1 English(EN) · Shaifalee Saxena, Alexander Scheinker · 2026-06-11 04:00

Mahalanobis-Guided Latent OOD Detection for Hybrid ES-DRL Control in Time-Varying Systems

arXiv:2606.11474v1 Announce Type: new Abstract: In this paper, we study Mahalanobis-guided latent out-of-distribution (OOD) detection for test-time RL controller switching in nonlinear time-varying systems. RL controllers can quickly control high-dimensional systems within the tr…

COVERAGE [1]

Mahalanobis-Guided Latent OOD Detection for Hybrid ES-DRL Control in Time-Varying Systems

RELATED ENTITIES

RELATED TOPICS