Author of the publication

Decision Transformer: Reinforcement Learning via Sequence Modeling

, , , , , , , , and . (2021)cite arxiv:2106.01345Comment: First two authors contributed equally. Last two authors advised equally.

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

AlignFlow: Learning from multiple domains via normalizing flows., , , , and . DGS@ICLR, OpenReview.net, (2019)Permutation Invariant Graph Generation via Score-Based Generative Modeling., , , , , and . AISTATS, volume 108 of Proceedings of Machine Learning Research, page 4474-4484. PMLR, (2020)Online Decision Transformer., , and . ICML, volume 162 of Proceedings of Machine Learning Research, page 27042-27059. PMLR, (2022)Scaling Pareto-Efficient Decision Making via Offline Multi-Objective RL., , and . ICLR, OpenReview.net, (2023)Moser Flow: Divergence-based Generative Modeling on Manifolds., , , and . NeurIPS, page 17669-17680. (2021)Bias Correction of Learned Generative Models using Likelihood-Free Importance Weighting., , , , , , and . NeurIPS, page 11056-11068. (2019)PiRank: Scalable Learning To Rank via Differentiable Sorting., , , and . NeurIPS, page 21644-21654. (2021)Peering Through Preferences: Unraveling Feedback Acquisition for Aligning Large Language Models., , and . CoRR, (2023)Closed-loop optimization of fast-charging protocols for batteries with machine learning., , , , , , , , , and 6 other author(s). Nat., 578 (7795): 397-402 (2020)Learning from an Exploring Demonstrator: Optimal Reward Estimation for Bandits., , , , and . CoRR, (2021)