Between MDPs and Semi-MDPs: A Framework for Temporal Abstraction in Reinforcement Learning
Between MDPs and Semi-MDPs: A Framework for Temporal Abstraction in Reinforcement Learning
Richard S. Sutton, Doina Precup, and Satinder P. Singh, 1999
Download
Abstract
(unavailable)
BibTeX Entry
@Article{Sutton+PS:1999, author = "Sutton, Richard S. and Precup, Doina and Singh, Satinder P.", title = "Between MDPs and Semi-MDPs: A Framework for Temporal Abstraction in Reinforcement Learning", journal = "Artificial Intelligence", year = "1999", volume = "112", number = "1--2", pages = "181--211", url = "http://webdocs.cs.ualberta.ca/~sutton/papers/SPS-aij.pdf", bib2html_rescat = "Representation", }