Author of the publication

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Behavior of an Adaptive Self-organizing Autonomous Agent Working with Cues and Competing Concepts, and . Adaptive Behavior, 2 (2): 131--160 (1994)LMS-2: Towards an Algorithm that is as Cheap as LMS and Almost as Efficient as RLS, , and . CDC, page 1181--1188. (2009)Randomized Exploration in Generalized Linear Bandits, , , , , and . AISTATS, (March 2020)Mixing Time Estimation in Reversible Markov Chains from a Single Sample Path, , , , , and . Annals of Applied Probability, 29 (4): 2439--2480 (July 2019)Deterministic Independent Component Analysis, , and . ICML, page 2521--2530. (2015)Partial monitoring -- classification, regret bounds, and algorithms, , , , and . Mathematics of Operations Research, (2014)Policy Error Bounds for Model-Based Reinforcement Learning with Factored Linear Models, and . COLT, page 121--151. (2016)Error Propagation for Approximate Policy and Value Iteration (extended version), , and . NIPS, (December 2010)Cleaning up the neighborhood: A full classification for adversarial partial monitoring, and . ALT, (February 2019)Toward Minimax Off-policy Value Estimation, , and . AISTATS, page 608--616. (2015)