Author of the publication

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

BubbleRank: Safe Online Learning to Re-Rank via Implicit Click Feedback.

C. Li, B. Kveton, T. Lattimore, I. Markov, M. de Rijke, C. Szepesvári, and M. Zoghi. UAI, volume 115 of Proceedings of Machine Learning Research, page 196-206. AUAI Press, (2019)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

Gergely Csaba

Csaba Lovász

Csaba Szinyei

Csaba Dávid

Csaba Miskey

Other publications of authors with the same name

Exploiting Symmetries to Construct Efficient MCMC Algorithms With an Application to SLAM.R. Shariff, A. György, and C. Szepesvári. AISTATS, volume 38 of JMLR Workshop and Conference Proceedings, JMLR.org, (2015)Deterministic Independent Component Analysis.R. Huang, A. György, and C. Szepesvári. ICML, volume 37 of JMLR Workshop and Conference Proceedings, page 2521-2530. JMLR.org, (2015)Online Learning under Delayed Feedback.P. Joulani, A. György, and C. Szepesvári. ICML (3), volume 28 of JMLR Workshop and Conference Proceedings, page 1453-1461. JMLR.org, (2013)On Identifying Good Options under Combinatorially Structured Feedback in Finite Noisy Environments.Y. Wu, A. György, and C. Szepesvári. ICML, volume 37 of JMLR Workshop and Conference Proceedings, page 1283-1291. JMLR.org, (2015)A Modular Analysis of Adaptive (Non-)Convex Optimization: Optimism, Composite Objectives, and Variational Bounds.P. Joulani, A. György, and C. Szepesvári. ALT, volume 76 of Proceedings of Machine Learning Research, page 681-720. PMLR, (2017)Differentiable Meta-Learning in Contextual Bandits.B. Kveton, M. Mladenov, C. Hsu, M. Zaheer, C. Szepesvári, and C. Boutilier. CoRR, (2020)A Randomized Strategy for Learning to Combine Many FeaturesA. Afkanpour, A. György, C. Szepesvári, and M. Bowling. CoRR, (2012)Exponential Lower Bounds for Planning in MDPs With Linearly-Realizable Optimal Action-Value Functions.G. Weisz, P. Amortila, and C. Szepesvári. CoRR, (2020)Online Markov Decision Processes Under Bandit Feedback.G. Neu, A. György, C. Szepesvári, and A. Antos. IEEE Trans. Autom. Control., 59 (3): 676-691 (2014)Training parsers by inverse reinforcement learning.G. Neu, and C. Szepesvári. Mach. Learn., 77 (2-3): 303-337 (2009)

BibSonomy

Disambiguation of "Szepesvári, Csaba"

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

BubbleRank: Safe Online Learning to Re-Rank via Implicit Click Feedback.

Please choose a person to relate this publication to

Gergely Csaba

Csaba Lovász

Csaba Szinyei

Csaba Dávid

Csaba Miskey

Other publications of authors with the same name

Disambiguation

BibSonomy

Disambiguation of "Szepesvári, Csaba"

copydeleteadd this publication to your clipboardcommunity posthistory of this postURLDOIBibTeXEndNoteAPAChicagoDIN 1505HarvardMSOffice XML BubbleRank: Safe Online Learning to Re-Rank via Implicit Click Feedback.

Please choose a person to relate this publication to

Gergely Csaba

Csaba Lovász

Csaba Szinyei

Csaba Dávid

Csaba Miskey

Other publications of authors with the same name

Disambiguation

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

BubbleRank: Safe Online Learning to Re-Rank via Implicit Click Feedback.