Arrow Research search
Back to AAAI

AAAI 2016

Algorithms for Differentially Private Multi-Armed Bandits

Conference Paper Papers Artificial Intelligence

Abstract

We present differentially private algorithms for the stochastic Multi-Armed Bandit (MAB) problem. This is a problem for applications such as adaptive clinical trials, experiment design, and user-targeted advertising where private information is connected to individual rewards. Our major contribution is to show that there exist (, δ) differentially private variants of Upper Confidence Bound algorithms which have optimal regret, O( −1 + log T). This is a significant improvement over previous results, which only achieve poly-log regret O( −1 log3 T), because of our use of a novel intervalbased mechanism. We also substantially improve the bounds of previous family of algorithms which use a continual release mechanism. Experiments clearly validate our theoretical bounds.

Authors

Keywords

No keywords are indexed for this paper.

Context

Venue
AAAI Conference on Artificial Intelligence
Archive span
1980-2026
Indexed papers
28718
Paper id
136194185678220022