Peter Stone's Selected Publications

Classified by TopicClassified by Publication TypeSorted by DateSorted by First Author Last NameClassified by Funding Source


Function Approximation via Tile Coding: Automating Parameter Choice

Function Approximation via Tile Coding: Automating Parameter Choice.
Alexander A. Sherstov and Peter Stone.
In J.-D. Zucker and I. Saitta, editors, SARA 2005, Lecture Notes in Artificial Intelligence, pp. 194–205, Springer Verlag, Berlin, 2005.
SARA-05.
Official version from Publisher's Webpage© Springer-Verlag

Download

[PDF]187.7kB  [postscript]583.4kB  [slides.pdf]193.7kB  

Abstract

Reinforcement learning (RL) is a powerful abstraction of sequential decision making that has an established theoretical foundation and has proven effective in a variety of small, simulated domains. The success of RL on real-world problems with large, often continuous state and action spaces hinges on effective function approximation. Of the many function approximation schemes proposed, tile coding strikes an empirically successful balance among representational power, computational cost, and ease of use and has been widely adopted in recent RL work. This paper demonstrates that the performance of tile coding is quite sensitive to parameterization. We present detailed experiments that isolate the effects of parameter choices and provide guidance to their setting. We further illustrate that no single parameterization achieves the best performance throughout the learning curve, and contribute an automated technique for adjusting tile-coding parameters online. Our experimental findings confirm the superiority of adaptive parameterization to fixed settings. This work aims to automate the choice of approximation scheme not only on a problem basis but also throughout the learning process, eliminating the need for a substantial tuning effort.

BibTeX Entry

@incollection(SARA05,
        author="Alexander A.\ Sherstov and Peter Stone",
        title="Function Approximation via Tile Coding: Automating Parameter Choice",
        booktitle="SARA 2005",
        series="Lecture Notes in Artificial Intelligence",      
        editor="J.-D.\ Zucker and I.\ Saitta",
	volume="3607",
        Publisher="Springer Verlag",address="Berlin",year="2005",
	pages="194--205",
        abstract={
                  Reinforcement learning (RL) is a powerful
                  abstraction of sequential decision making that has
                  an established theoretical foundation and has proven
                  effective in a variety of small, simulated domains.
                  The success of RL on real-world problems with large,
                  often continuous state and action spaces hinges on
                  effective \emph{function approximation.} Of the many
                  function approximation schemes proposed, \emph{tile
                  coding} strikes an empirically successful balance
                  among representational power, computational cost,
                  and ease of use and has been widely adopted in
                  recent RL work. This paper demonstrates that the
                  performance of tile coding is quite sensitive to
                  parameterization. We present detailed experiments
                  that isolate the effects of parameter choices and
                  provide guidance to their setting. We further
                  illustrate that \emph{no single parameterization}
                  achieves the best performance throughout the
                  learning curve, and contribute an \emph{automated}
                  technique for adjusting tile-coding parameters
                  online. Our experimental findings confirm the
                  superiority of adaptive parameterization to fixed
                  settings. This work aims to automate the choice of
                  approximation scheme not only on a problem basis but
                  also throughout the learning process, eliminating
                  the need for a substantial tuning effort.
                 },
        wwwnote={<a href="http://sara2005.limbio-paris13.org/">SARA-05</a>.<br>
Official version from <a href="http://dx.doi.org/10.1007/11527862_14">Publisher's Webpage</a>&copy Springer-Verlag},
)

Generated by bib2html.pl (written by Patrick Riley ) on Tue Nov 19, 2024 10:24:40