Brief

last 24h

[2/2] 224 sources

Multi-source AI news clustered, deduplicated, and scored 0–100 across authority, cluster strength, headline signal, and time decay.

TOOL · arXiv cs.LG English(EN) · 7h

Sharp analysis of linear ensemble sampling

Researchers have published a new analysis of linear ensemble sampling (ES) in stochastic linear bandits, demonstrating its effectiveness with standard Gaussian perturbations. The study shows that ES can achieve a regret of \tilde O(d^{3/2}\sqrt n) with an ensemble size of m=\Theta(d\log n), matching the performance of Thompson sampling while maintaining comparable computational costs. The novel proof technique involves reducing the analysis to a time-uniform exceedance problem for independent Brownian motions, offering a new perspective on randomized exploration in linear bandits. AI
RESEARCH · arXiv cs.LG English(EN) · 23h · [2 sources]

Near-Optimal Stochastic Linear Bandits with Delay

Researchers have published a paper detailing near-optimal regret guarantees for stochastic linear bandits with delayed feedback. The study distinguishes between loss-independent and loss-dependent delays, finding that the former incurs only an additive penalty that is dimension-free. In contrast, loss-dependent delays present greater challenges, with penalties scaling with the square root of the dimension, making them significantly harder than in multi-armed bandit scenarios. AI

Sharp analysis of linear ensemble sampling