@kirk86

Minimax Regret for Bandit Convex Optimisation of Ridge Functions

. (2021)cite arxiv:2106.00444Comment: Correcting an (instructive) error that leads to a weaker result.

Abstract

We analyse adversarial bandit convex optimisation with an adversary that is restricted to playing functions of the form $f_t(x) = g_t(x, þeta\rangle)$ for convex $g_t : R R$ and unknown $þeta R^d$ that is homogeneous over time. We provide a short information-theoretic proof that the minimax regret is at most $O(d n łog(n diam(K)))$ where $n$ is the number of interactions, $d$ the dimension and $diam(K)$ is the diameter of the constraint set.

Description

[2106.00444] Minimax Regret for Bandit Convex Optimisation of Ridge Functions

Links and resources

Tags

community

  • @kirk86
  • @dblp
@kirk86's tags highlighted