English
 
Help Privacy Policy Disclaimer
  Advanced SearchBrowse

Item

ITEM ACTIONSEXPORT
  PAC-Bayesian Analysis of Contextual Bandits

Seldin, Y., Auer, P., Laviolette, F., Shawe-Taylor, J., & Ortner, R. (2012). PAC-Bayesian Analysis of Contextual Bandits. In J. Shaw-Taylor, R. Zemel, P. Bartlett, F. Pereira, & K. Weinberger (Eds.), Advances in Neural Information Processing Systems 24 (pp. 1683-1691). Red Hook, NY, USA: Curran.

Item is

Files

show Files

Locators

show
hide
Description:
-
OA-Status:

Creators

show
hide
 Creators:
Seldin, Y1, Author           
Auer, P, Author
Laviolette, F, Author
Shawe-Taylor, J, Author
Ortner, R, Author
Affiliations:
1Dept. Empirical Inference, Max Planck Institute for Intelligent Systems, Max Planck Society, DE, ou_1497647              

Content

show
hide
Free keywords: -
 Abstract: We derive an instantaneous (per-round) data-dependent regret bound for stochastic multiarmed bandits with side information (also known as contextual bandits). The scaling of our regret bound with the number of states (contexts) N goes as \sqrtN I_{ρ_t}(S;A)}, where I_{ρ_t}(S;A) is the mutual information between states and actions (the side information) used by the algorithm at round t. If the algorithm uses all the side information, the regret bound scales as \sqrt{N \ln K}, where K is the number of actions (arms). However, if the side information I_{ρ_t}(S;A) is not fully used, the regret bound is significantly tighter. In the extreme case, when I_{ρ_t(S;A) = 0, the dependence on the number of states reduces from linear to logarithmic. Our analysis allows to provide the algorithm large amount of side information, let the algorithm to decide which side information is relevant for the task, and penalize the algorithm only for the side information that it is using de facto. We also present an algorithm for multiarmed bandits with side information with computational complexity that is a linear in the number of actions.

Details

show
hide
Language(s):
 Dates: 2012-01
 Publication Status: Issued
 Pages: -
 Publishing info: -
 Table of Contents: -
 Rev. Type: -
 Identifiers: BibTex Citekey: SeldinALSO2011
 Degree: -

Event

show
hide
Title: Twenty-Fifth Annual Conference on Neural Information Processing Systems (NIPS 2011)
Place of Event: Granada, Spain
Start-/End Date: -

Legal Case

show

Project information

show

Source 1

show
hide
Title: Advances in Neural Information Processing Systems 24
Source Genre: Proceedings
 Creator(s):
Shaw-Taylor, J, Editor
Zemel, RS, Editor
Bartlett, P, Editor
Pereira, F, Editor
Weinberger, KQ, Editor
Affiliations:
-
Publ. Info: Red Hook, NY, USA : Curran
Pages: - Volume / Issue: - Sequence Number: - Start / End Page: 1683 - 1691 Identifier: ISBN: 978-1-618-39599-3