![]() |
CiteULike | ![]() |
mdreid's CiteULike | ![]() |
![]() |
|
![]() |
Register | ![]() |
Log in | ![]() |
An Efficient Algorithm for Bandit Linear Optimization |
Reviews
[Write a review of this article]
Notes for this articleThis tech report is a slight longer version of the authors subsequent COLT 2008 submission "Competing in the Dark: An Efficient Algorithm for Bandit Linear Optimization".
Find related articles from these CiteULike users
Find related articles with these CiteULike tags
Posting History
AbstractWe introduce an efficient algorithm for the problem of online linear optimization in the bandit setting which achieves the optimal O∗ (√T ) regret. The setting is a natural generalization of the non-stochastic multi-armed bandit problem, and the existence of an efficient optimal algorithm has been posed as an open problem in a number of recent papers. We show how the difficulties encountered by previous approaches are overcome by the use of a self-concordant potential function. Our approach presents a novel connection between online learning and interior point methods.
BibTeX record
RIS record