UTCS Artificial Intelligence
courses
talks/events
demos
people
projects
publications
software/data
labs
areas
admin
Fast Online Lexicon Learning for Grounded Language Acquisition (2012)
David L. Chen
Learning a semantic lexicon is often an important first step in building a system that learns to interpret the meaning of natural language. It is especially important in language grounding where the training data usually consist of language paired with an ambiguous perceptual context. Recent work by Chen and Mooney (2011) introduced a lexicon learning method that deals with ambiguous relational data by taking intersections of graphs. While the algorithm produced good lexicons for the task of learning to interpret navigation instructions, it only works in batch settings and does not scale well to large datasets. In this paper we introduce a new online algorithm that is an order of magnitude faster and surpasses the state-of-the-art results. We show that by changing the grammar of the formal meaning representation language and training on additional data collected from Amazon's Mechanical Turk we can further improve the results. We also include experimental results on a Chinese translation of the training data to demonstrate the generality of our approach.
View:
PDF
Citation:
Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics (ACL-2012)
(2012), pp. 430--439.
Bibtex:
@article{chen:acl2012, title={Fast Online Lexicon Learning for Grounded Language Acquisition}, author={David L. Chen}, booktitle={Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics (ACL-2012)}, month={July}, pages={430--439}, url="http://www.cs.utexas.edu/users/ai-lab?chen:acl2012", year={2012} }
Presentation:
Slides (PPT)
People
David Chen
Ph.D. Alumni
cooldc [at] hotmail com
Areas of Interest
Language and Robotics
Machine Learning
Natural Language Processing
Demos
Learning to Interpret Natural Language Navigation Instructions from Observations
David L. Chen
2012
Labs
Machine Learning