Algorithms for Differentially Private Multi-Armed Bandits

Aristide Tossou; Christos Dimitrakakis

Back to AAAI

AAAI 2016

Algorithms for Differentially Private Multi-Armed Bandits

Conference Paper Papers Artificial Intelligence

PDF Details

Abstract

We present differentially private algorithms for the stochastic Multi-Armed Bandit (MAB) problem. This is a problem for applications such as adaptive clinical trials, experiment design, and user-targeted advertising where private information is connected to individual rewards. Our major contribution is to show that there exist (, δ) differentially private variants of Upper Conﬁdence Bound algorithms which have optimal regret, O( −1 + log T). This is a signiﬁcant improvement over previous results, which only achieve poly-log regret O( −1 log3 T), because of our use of a novel intervalbased mechanism. We also substantially improve the bounds of previous family of algorithms which use a continual release mechanism. Experiments clearly validate our theoretical bounds.

Authors

Keywords

No keywords are indexed for this paper.

Context

Venue: AAAI Conference on Artificial Intelligence
Archive span: 1980-2026
Indexed papers: 28718
Paper id: 136194185678220022