UTCS Artificial Intelligence
courses
talks/events
demos
people
projects
publications
software/data
labs
areas
admin
Using Soft-Matching Mined Rules to Improve Information Extraction (2004)
Un Yong Nahm
and
Raymond J. Mooney
By discovering predictive relationships between different pieces of extracted data, data-mining algorithms can be used to improve the accuracy of information extraction. However, textual variation due to typos, abbreviations, and other sources can prevent the productive discovery and utilization of hard-matching rules. Recent methods for inducing
soft-matching
rules from extracted data can more effectively find and exploit predictive relationships in textual data. This paper presents techniques for using mined soft-matching association rules to increase the accuracy of information extraction. Experimental results on a corpus of computer-science job postings demonstrate that soft-matching rules improve information extraction more effectively than hard-matching rules.
View:
PDF
,
PS
Citation:
In
Proceedings of the AAAI-2004 Workshop on Adaptive Text Extraction and Mining (ATEM-2004)
, pp. 27-32, San Jose, CA, July 2004.
Bibtex:
@inproceedings{nahm:atem04, title={Using Soft-Matching Mined Rules to Improve Information Extraction}, author={Un Yong Nahm and Raymond J. Mooney}, booktitle={Proceedings of the AAAI-2004 Workshop on Adaptive Text Extraction and Mining (ATEM-2004)}, month={July}, address={San Jose, CA}, pages={27-32}, url="http://www.cs.utexas.edu/users/ai-lab?nahm:atem04", year={2004} }
People
Raymond J. Mooney
Faculty
mooney [at] cs utexas edu
Un Yong Nahm
Ph.D. Alumni
pebronia [at] acm org
Areas of Interest
Information Extraction
Machine Learning
Text Data Mining
Labs
Machine Learning