UTCS Artificial Intelligence
courses
talks/events
demos
people
projects
publications
software/data
labs
areas
admin
Compositional Instruction Following with Language Models and Reinforcement Learning (2024)
Vanya Cohen
, Geraud Nangue Tasse, Nakul Gopalan, Steven James, Matthew Gombolay,
Raymond Mooney
, Benjamin Rosman
Combining reinforcement learning with language grounding is challenging as the agent needs to explore the environment while simultaneously learning multiple language-conditioned tasks. To address this, we introduce a novel method: the compositionally-enabled reinforcement learning language agent (CERLLA). Our method reduces the sample complexity of tasks specified with language by leveraging compositional policy representations and a semantic parser trained using reinforcement learning and in-context learning. We evaluate our approach in an environment requiring function approximation and demonstrate compositional generalization to novel tasks. Our method significantly outperforms the previous best non-compositional baseline in terms of sample complexity on 162 tasks designed to test compositional generalization. Our model attains a higher success rate and learns in fewer steps than the non-compositional baseline. It reaches a success rate equal to an oracle policy’s upper-bound performance of 92 percent. With the same number of environment steps, the baseline only reaches a success rate of 80 percent.
View:
PDF
,
Arxiv
Citation:
Transactions on Machine Learning Research
(2024).
Bibtex:
@article{cohen:tmlr24, title={Compositional Instruction Following with Language Models and Reinforcement Learning}, author={Vanya Cohen and Geraud Nangue Tasse and Nakul Gopalan and Steven James and Matthew Gombolay and Raymond Mooney and Benjamin Rosman}, booktitle={Transactions on Machine Learning Research}, month={December}, url="http://www.cs.utexas.edu/users/ai-labpub-view.php?PubID=128113", year={2024} }
People
Vanya Cohen
Ph.D. Student
vanya [at] utexas edu
Raymond J. Mooney
Faculty
mooney [at] cs utexas edu
Areas of Interest
Deep Learning
Language and Robotics
Reinforcement Learning
Labs
Machine Learning