Apple tasting problem regret bound found to be \u221aT

By PulseAugur Editorial · [2 sources] · 2026-06-02 16:28

Researchers have analyzed the "two-action apple-tasting problem" with switching costs, a scenario relevant to machine learning algorithms. They found that the expected regret for this problem is bounded by $\sqrt{T}$, which is better than the previously assumed $\widetilde O(T^{2/3})$ bound. This finding removes a potential obstruction in the classification of feedback-graph algorithms. AI

IMPACT Establishes a tighter theoretical bound for a class of learning algorithms, potentially influencing future algorithm design.

RANK_REASON The cluster contains an academic paper detailing a theoretical result in machine learning.

Read on arXiv cs.LG →

paper
other

AI-generated summary · Google Gemini · from 2 sources. How we write summaries →

COVERAGE [2]

arXiv cs.LG TIER_1 English(EN) · Tommaso Cesari, Roberto Colomboni · 2026-06-03 04:00

Two-Action Apple Tasting with Switching Costs

arXiv:2606.03851v1 Announce Type: new Abstract: We study the two-action apple-tasting problem with switching costs against an oblivious adversary. In an equivalent normalized formulation, at each round the learner chooses between a revealing action and a blind action: the reveali…
arXiv cs.LG TIER_1 English(EN) · Roberto Colomboni · 2026-06-02 16:28

Two-Action Apple Tasting with Switching Costs

We study the two-action apple-tasting problem with switching costs against an oblivious adversary. In an equivalent normalized formulation, at each round the learner chooses between a revealing action and a blind action: the revealing action gives reward $0$ and reveals the hidde…

COVERAGE [2]

Two-Action Apple Tasting with Switching Costs

Two-Action Apple Tasting with Switching Costs

RELATED ENTITIES

RELATED TOPICS