Author of the publication

Finite Time Bounds for Temporal Difference Learning with Function Approximation: Problems with some ``state-of-the-art'' results

, and . (August 2017)Technical Report.

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Uncertainty and Performance of Adaptive Controllers for Functionally Uncertain Output Feedback Systems, , and . CDC, page 4515--4520. Tampa, Florida, IEEE, (December 1998)Integration of ANN Cues, Dynamic AI Concepts and ANN Decision System into an Adaptive Self-Organizing Agent, and . 3rd Conf. on Artificial Intelligence, page 231--237. Budapest, Hungary, John von Neumann Society for Computer Sciences, (April 1993)Parallel and Robust Skeletonization Built on Self-organizing Elements, , , and . Neural Networks, (1999)Regularized Fitted Q-iteration for Planning in Continuous-Space Markovian Decision Problems, , , and . ACC, page 725--730. (2009)The Online Loop-free Stochastic Shortest-Path Problem, , and . COLT, page 231--243. (June 2010)Empirical Bernstein stopping, , and . ICML, page 672--679. (2008)REGO: Rank-based Estimation of Rényi Information using Euclidean Graph Optimization, , and . AISTATS, 9, page 852--859. (May 2010)Budgeted Distribution Learning of Belief Net Parameters, , , and . ICML, page 879--886. Omnipress, (June 2010)Convergence Results for Single-Step On-Policy Reinforcement-Learning Algorithms, , , and . Machine Learning, 38 (3): 287--308 (2000)Learning Near-optimal Policies with Bellman-residual Minimization based Fitted Policy Iteration and a Single Sample Path, , and . COLT, page 574--588. (2006)