Researchers have developed AdaPrivate-TS, a new differentially private contextual bandit algorithm that integrates Thompson Sampling with batched zCDP composition. This approach interprets the added Gaussian noise as increased uncertainty, leading to improved performance and privacy guarantees. Experiments on various datasets show AdaPrivate-TS achieves high percentages of non-private performance at different privacy budgets and outperforms other baselines, particularly when privacy amplification is applied. AI
IMPACT Enhances privacy in reinforcement learning applications, potentially enabling more sensitive data use in personalized systems.
RANK_REASON The cluster contains a research paper detailing a new algorithm for contextual bandits with differential privacy. [lever_c_demoted from research: ic=1 ai=1.0]
- AdaPrivate-TS
- DP-SVD
- Gaussian noise
- jester
- Mohammadreza Riyazat
- MovieLens
- Thompson sampling
- University of California, Berkeley
- zCDP
AI-generated summary · Google Gemini · from 3 sources. How we write summaries →