Author of the publication

BubbleRank: Safe Online Learning to Re-Rank via Implicit Click Feedback.

, , , , , , and . UAI, volume 115 of Proceedings of Machine Learning Research, page 196-206. AUAI Press, (2019)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Exploiting Symmetries to Construct Efficient MCMC Algorithms With an Application to SLAM., , and . AISTATS, volume 38 of JMLR Workshop and Conference Proceedings, JMLR.org, (2015)Deterministic Independent Component Analysis., , and . ICML, volume 37 of JMLR Workshop and Conference Proceedings, page 2521-2530. JMLR.org, (2015)Online Learning under Delayed Feedback., , and . ICML (3), volume 28 of JMLR Workshop and Conference Proceedings, page 1453-1461. JMLR.org, (2013)On Identifying Good Options under Combinatorially Structured Feedback in Finite Noisy Environments., , and . ICML, volume 37 of JMLR Workshop and Conference Proceedings, page 1283-1291. JMLR.org, (2015)A Modular Analysis of Adaptive (Non-)Convex Optimization: Optimism, Composite Objectives, and Variational Bounds., , and . ALT, volume 76 of Proceedings of Machine Learning Research, page 681-720. PMLR, (2017)Differentiable Meta-Learning in Contextual Bandits., , , , , and . CoRR, (2020)A Randomized Strategy for Learning to Combine Many Features, , , and . CoRR, (2012)Exponential Lower Bounds for Planning in MDPs With Linearly-Realizable Optimal Action-Value Functions., , and . CoRR, (2020)Online Markov Decision Processes Under Bandit Feedback., , , and . IEEE Trans. Autom. Control., 59 (3): 676-691 (2014)Training parsers by inverse reinforcement learning., and . Mach. Learn., 77 (2-3): 303-337 (2009)