Inproceedings,

Online Learning with Gaussian Payoffs and Side Observations

Y. Wu, A. György, and {. Szepesvári.
NIPS, page 1360--1368. (September 2015)

Abstract

We consider a sequential learning problem with Gaussian payoffs and side observations: after selecting an action i, the learner receives information about the payoff of every action j in the form of Gaussian observations whose mean is the same as the mean payoff, but the variance depends on the pair (i,j) (and may be infinite). The setup allows a more refined information transfer from one action to another than previous partial monitoring setups, including the recently introduced graph-structured feedback case. For the first time in the literature, we provide non-asymptotic problem-dependent lower bounds on the regret of any algorithm, which recover existing asymptotic problem-dependent lower bounds and finite-time minimax lower bounds available in the literature. We also provide algorithms that achieve the problem-dependent lower bound (up to some universal constant factor) or the minimax lower bounds (up to logarithmic factors).

BibTeX key: WGySz:NIPS15
entry type: inproceedings
booktitle: NIPS
year: 2015
month: September
pages: 1360--1368
pdf: papers/NIPS15-SideObs.pdf
date-modified: 2016-08-16 23:22:40 +0000
date-added: 2015-12-02 01:36:54 +0000

BibSonomy

Online Learning with Gaussian Payoffs and Side Observations

Abstract

Tags

Users

Comments and Reviewsshow / hide

Cite this publication

More citation styles

search on