Reinforcement Learning Algorithm for Partially Observable Markov Problems
Reinforcement Learning Algorithm for Partially Observable Markov Problems
Tommi Jaakkola, Satinder P. Singh, and Michael I. Jordan, 1995
Download
Abstract
(unavailable)
BibTeX Entry
@InProceedings{Jaakkola+SJ:1995, author = "Jaakkola, Tommi and Singh, Satinder P. and Jordan, Michael I.", title = "Reinforcement Learning Algorithm for Partially Observable Markov Problems", booktitle = "Advances in Neural Information Processing Systems 7 (NIPS 1994)", year = "1995", editor = "Tesauro, Gerald and Touretzky, David S. and Leen, Todd K.", publisher = "MIT Press", address = "Cambridge, MA, USA", pages = "345--352", url = "http://www.eecs.umich.edu/~baveja/Papers/Nips94b.pdf", bib2html_rescat = "Partial Observability", }