UTCS Artificial Intelligence
courses
talks/events
demos
people
projects
publications
software/data
labs
areas
admin
Guiding Interaction Behaviors for Multi-modal Grounded Language Learning (2017)
Jesse Thomason
, Jivko Sinapov, and
Raymond J. Mooney
Multi-modal grounded language learning connects language predicates to physical properties of objects in the world. Sensing with multiple modalities, such as audio, haptics, and visual colors and shapes while performing interaction behaviors like lifting, dropping, and looking on objects enables a robot to ground non-visual predicates like "empty" as well as visual predicates like "red". Previous work has established that grounding in multi-modal space improves performance on object retrieval from human descriptions. In this work, we gather behavior annotations from humans and demonstrate that these improve language grounding performance by allowing a system to focus on relevant behaviors for words like "white" or "half-full" that can be understood by looking or lifting, respectively. We also explore adding modality annotations (whether to focus on audio or haptics when performing a behavior), which improves performance, and sharing information between linguistically related predicates (if "green" is a color, "white" is a color), which improves grounding recall but at the cost of precision.
View:
PDF
Citation:
In
Proceedings of the Workshop on Language Grounding for Robotics at ACL 2017 (RoboNLP-17)
, Vancouver, Canada, August 2017.
Bibtex:
@inproceedings{thomason:robonlp17,, title={Guiding Interaction Behaviors for Multi-modal Grounded Language Learning}, author={Jesse Thomason and Jivko Sinapov and Raymond J. Mooney}, booktitle={Proceedings of the Workshop on Language Grounding for Robotics at ACL 2017 (RoboNLP-17)}, month={August}, address={Vancouver, Canada}, url="http://www.cs.utexas.edu/users/ai-labpub-view.php?PubID=127642", year={2017} }
People
Raymond J. Mooney
Faculty
mooney [at] cs utexas edu
Jesse Thomason
Ph.D. Alumni
thomason DOT jesse AT gmail
Areas of Interest
Language and Robotics
Natural Language Processing
Labs
Machine Learning