UTCS Artificial Intelligence
courses
talks/events
demos
people
projects
publications
software/data
labs
areas
admin
A Survey of Robotic Language Grounding: Tradeoffs Between Symbols and Embeddings (2024)
Vanya Cohen
, Jason Xinyu Liu,
Raymond Mooney
, Stefanie Tellex, David Watkins
With large language models, robots can understand language more flexibly and more capable than ever before. This survey reviews recent literature and situates it into a spectrum with two poles: 1) mapping between language and some manually defined formal representation of meaning, and 2) mapping between language and high-dimensional vector spaces that translate directly to low-level robot policy. Using a formal representation allows the meaning of the language to be precisely represented, limits the size of the learning problem, and leads to a framework for interpretability and formal safety guarantees. Methods that embed language and perceptual data into high-dimensional spaces avoid this manually specified symbolic structure and thus have the potential to be more general when fed enough data but require more data and computing to train. We discuss the benefits and trade-offs of each approach and finish by providing directions for future work that achieves the best of both worlds.
View:
PDF
,
Arxiv
Citation:
International Joint Conference on Artificial Intelligence (IJCAI)
(2024).
Bibtex:
@article{cohen:ijcai24, title={A Survey of Robotic Language Grounding: Tradeoffs Between Symbols and Embeddings}, author={Vanya Cohen and Jason Xinyu Liu and Raymond Mooney and Stefanie Tellex and David Watkins}, booktitle={International Joint Conference on Artificial Intelligence (IJCAI)}, month={August}, url="http://www.cs.utexas.edu/users/ai-labpub-view.php?PubID=128058", year={2024} }
Presentation:
Slides (PDF)
Poster
People
Vanya Cohen
Ph.D. Student
vanya [at] utexas edu
Raymond J. Mooney
Faculty
mooney [at] cs utexas edu
Areas of Interest
Language and Robotics
Labs
Machine Learning