Author of the publication

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Finite Time Bounds for Temporal Difference Learning with Function Approximation: Problems with some ``state-of-the-art'' results

C. Lakshminarayanan, and {. Szepesvári. (August 2017)Technical Report.

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

Éva SzuÌ cs

Jürgen Csomor

Georg Csapo

Attila Csemez

Katharina Csontos

Other publications of authors with the same name

Uncertainty and Performance of Adaptive Controllers for Functionally Uncertain Output Feedback SystemsM. French, {. Szepesvári, and E. Rogers. CDC, page 4515--4520. Tampa, Florida, IEEE, (December 1998)Integration of ANN Cues, Dynamic AI Concepts and ANN Decision System into an Adaptive Self-Organizing Agent{. Szepesvári, and A. Lörincz. 3rd Conf. on Artificial Intelligence, page 231--237. Budapest, Hungary, John von Neumann Society for Computer Sciences, (April 1993)Parallel and Robust Skeletonization Built on Self-organizing Elements{. Kalmár, {. Marczell, {. Szepesvári, and A. Lörincz. Neural Networks, (1999)Regularized Fitted Q-iteration for Planning in Continuous-Space Markovian Decision ProblemsA. Farahmand, M. Ghavamzadeh, {. Szepesvári, and S. Mannor. ACC, page 725--730. (2009)The Online Loop-free Stochastic Shortest-Path ProblemG. Neu, A. György, and {. Szepesvári. COLT, page 231--243. (June 2010)Empirical Bernstein stoppingV. Mnih, {. Szepesvári, and J. Audibert. ICML, page 672--679. (2008)REGO: Rank-based Estimation of Rényi Information using Euclidean Graph OptimizationB. Póczos, S. Kirshner, and {. Szepesvári. AISTATS, 9, page 852--859. (May 2010)Budgeted Distribution Learning of Belief Net ParametersL. Li, B. Póczos, R. Greiner, and {. Szepesvári. ICML, page 879--886. Omnipress, (June 2010)Convergence Results for Single-Step On-Policy Reinforcement-Learning AlgorithmsS. Singh, T. Jaakkola, M. Littman, and {. Szepesvári. Machine Learning, 38 (3): 287--308 (2000)Learning Near-optimal Policies with Bellman-residual Minimization based Fitted Policy Iteration and a Single Sample PathA. Antos, {. Szepesvári, and R. Munos. COLT, page 574--588. (2006)

BibSonomy

Disambiguation of "Szepesvári, Cs."

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Finite Time Bounds for Temporal Difference Learning with Function Approximation: Problems with some ``state-of-the-art'' results

Please choose a person to relate this publication to

Éva SzuÌ cs

Jürgen Csomor

Georg Csapo

Attila Csemez

Katharina Csontos

Other publications of authors with the same name

Disambiguation

BibSonomy

Disambiguation of "Szepesvári, Cs."

copydeleteadd this publication to your clipboardcommunity posthistory of this postURLDOIBibTeXEndNoteAPAChicagoDIN 1505HarvardMSOffice XML Finite Time Bounds for Temporal Difference Learning with Function Approximation: Problems with some ``state-of-the-art'' results

Please choose a person to relate this publication to

Éva SzuÌ cs

Jürgen Csomor

Georg Csapo

Attila Csemez

Katharina Csontos

Other publications of authors with the same name

Disambiguation

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Finite Time Bounds for Temporal Difference Learning with Function Approximation: Problems with some ``state-of-the-art'' results