UTCS Artificial Intelligence
courses
talks/events
demos
people
projects
publications
software/data
labs
areas
admin
Using Natural Language for Reward Shaping in Reinforcement Learning (2019)
Prasoon Goyal
,
Scott Niekum
,
Raymond J. Mooney
Recent reinforcement learning (RL) approaches have shown strong performance in complex domains such as Atari games, but are often highly sample inefficient. A common approach to reduce interaction time with the environment is to use reward shaping, which involves carefully designing reward functions that provide the agent intermediate rewards for progress towards the goal. However, designing appropriate shaping rewards is known to be difficult as well as time-consuming. In this work, we address this problem by using natural language instructions to perform reward shaping. We propose the LanguagE-Action Reward Network (LEARN), a framework that maps free-form natural language instructions to intermediate rewards based on actions taken by the agent. These intermediate language-based rewards can seamlessly be integrated into any standard reinforcement learning algorithm. We experiment with Montezuma’s Revenge from the Atari Learning Environment, a popular benchmark in RL. Our experiments on a diverse set of 15 tasks demonstrate that, for the same number of interactions with the environment, language-based rewards lead to successful completion of the task 60 % more often on average, compared to learning without language.
View:
PDF
Citation:
In
Proceedings of the 28th International Joint Conference on Artificial Intelligence
, Macao, China, August 2019.
Bibtex:
@inproceedings{goyal:ijcai19, title={Using Natural Language for Reward Shaping in Reinforcement Learning}, author={Prasoon Goyal and Scott Niekum and Raymond J. Mooney}, booktitle={Proceedings of the 28th International Joint Conference on Artificial Intelligence}, month={August}, address={Macao, China}, url="http://www.cs.utexas.edu/users/ai-labpub-view.php?PubID=127757", year={2019} }
Presentation:
Slides (PDF)
Poster
People
Prasoon Goyal
Ph.D. Alumni
pgoyal [at] cs utexas edu
Prasoon Goyal
Ph.D. Alumni
pgoyal [at] cs utexas edu
Raymond J. Mooney
Faculty
mooney [at] cs utexas edu
Scott Niekum
Faculty
sniekum [at] cs utexas edu
Areas of Interest
Deep Learning
Language and Robotics
Natural Language Processing
Reinforcement Learning
Labs
Machine Learning
The Personal Autonomous Robotics Lab (PeARL)